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In  the  main  body  of  the  report^  entitled  “Higher  Degree  Linear 
Approximations  of  Nonlinear  Systems,  xve^developr a  new  method  for 
obtaining  higher  degree  linear  approximations  of  a  certain  class  of 
nonlinear  „ control  systems.  The  standard  approach  in  the  analysis  and 
synthesis  of  nonlinear  systems  is  a  first  order  approximation  by  a  linear 
modely  This  is  usually  performed  by  obtaining  a  series  expansion  of  the 
systeni  at  some  nominal  operating  point  and  retaining  only  the  first  degree 
terms  in  the  series.  Obviously,  the  accuracy  of  this  approximation  depends 
on  how  far  the  system  moves  away  from  the  nominal  point,  and  on  the 
relative  magnitudes  of  the  higher  degree  terms  in  the  series  expansion.  In 
the  report„JW£  seeksim  approximation  for  a  nonlinear  system  by  a  linear 
model  up  to  higher  degrees  than  one.  This  is  achieved  by  finding  an 
appropriate  nonlinear  coordinate  transformation-nonlinear  feedback  pair  to 
perform  the  higher  degree  linearization.  With  the  proposed  method,  one 
can  improve  the  accuracy  of  the  approximation  up  to  arbitrarily  higher 
degrees,  provided  certain  solvability  conditions  are  satisfied.  The  Hunt-Su 
linearizability  theorem  makes  these  conditions  precise.  Our  approach  to  the 
solution  of  this  linearization  problem  is  similar  to  Poincare's  Normal  Form 
Theorem  in  formulation,  but  different  in  its  solution  method.  ^After  some 
mathematical  background  we  derive  a  set  of  equations  (called1  the 
Homological  Equations};based  on  the  goal  of  obtaining  a  model  accurate  to 
a  higher  degree  in  the  series  expansion.  A  solution  to  this  system  of  linear 
equations  is  equivalent  to  the  solution  to  the  problem  of  linearization  up  to 
higher  degrees  by  coordinate  change  and  feedback.  However,  it  is 
generally  not  possible  to  solve  the  system  of  equations  exactly.  We  outline 
a  method  for  systematically  finding  approximate  solutions  to.  these 
equations  using  singular  value  decomposition,  while  minimizing  an  error 


with  respect  to  some  defined  norm.  The  solution  thus  found  minimizes  the 
error  between  the  approximate  linearization  of  the  given  system  and  a 
“nearby”  one  that  is  exactly  linearizable  (in  the  sense  of  Hunt-Su)  up  to  the 
specified  degree  of  approximation.  We  present  a  computer  program 
written  in  the  MATLAB  language  that  automates  the  solution  of  the 
considerably  large  system  of  equations.  Finally,  we  demonstrate  the 
applications  of  the  method  and  the  efficiency  of  the  results  by  several 
examples  and  simulations. 

In  Appendix  A  we  demonstrate  the  usage  of  the  program  with  an 
example  session  recorded  during  running  MATLAB. 

Appendix  B  contains  the  additional  publications  supported  by  the 

grant. 


vi 


1.  INTRODUCTION 


1.1  Background 

The  state  space  approach  to  linear  control  systems  matured  into  a  well- 
defined  and  powerful  discipline  by  the  70's.  Contrary  to  linear  systems, 
nonlinear  control  systems  have  been  studied  with  many  different  methods 
of  approach,  usually  depending  on  the  type  of  nonlinearites  involved.  In 
this  framework,  many  tools  of  analysis  involving  qualitative,  quantitative 
and  computer-aided  methods  were  developed  as  diverse  as  perturbation 
methods,  limit  cycle  analysis,  describing  functions,  and  graphical  phase- 
portrait  methods  [5,6].  Most  of  these  approaches  were  suited  only  to  the 
specific  types  of  nonlinearities  for  which  they  were  developed.  Until  the 
mid-seventies,  a  coherent  theory  of  nonlinear  systems  did  not  seem 
possible  for  such  diverse  types  of  systems. 

However,  in  recent  years,  a  rich  theory  for  nonlinear  systems  has 
been  developed  using  differential  geometric  methods.  We  can  now  say  that 
a  theory  for  nonlinear  control  systems  exists.  In  fact,  the  differential 
geometric  setting  allows  the  generalization  of  many  known  classical  results 
in  linear  systems  theory  to  nonlinear  systems.  With  the  introduction  of 
differential  geometric  tools,  many  interesting  results  have  been  obtained 
for  nonlinear  controllability,  observability,  equivalence,  decomposition, 
optimality,  control  synthesis,  linearization  and  many  others.  -In  this 


broader  sense,  it  can  be  stated  that  linear  systems  are  a  special  case  of  the 
more  general  class  of  nonlinear  control  systems. 

The  first  major  developments  in  the  general  theory  for  nonlinear 
systems  were  through  the  introduction  of  differential  geometry  by 
Hermann  [15],  the  analysis  of  linear  multivariable  systems  using  a 
geometric  approach  by  Wonham  [46],  and  one  of  the  first  important  results 
in  the  transformation  of  a  nonlinear  system  into  a  linear  system  by  Krener 
[27,28].  We  refer  the  reader  to  Sussmann  [44]  for  an  excellent  survey  and 
bibliography. 


1.2  Motivation 

In  the  analysis  of  scientific  and  engineering  systems,  one  often  encounters 
situations  which  do  not  lend  themselves  to  exact  solutions  by  conventional 
methods.  The  assumption  of  linearity  in  most  control  system  models,  for 
example,  is  an  oversimplification  at  best.  This  assumption,  of  course, 
reflects  the  difficulties  one  would  rather  avoid  in  dealing  with  an  otherwise 
nonlinear  model.  Indeed,  one  can  seldom  find  a  technique  to  solve  a  given 
nonlinear  problem  exactly.  Since  the  control  system  designer  is  equipped 
with  powerful  methods  and  tools  for  attacking  linear  control  systems,  the 
motivation  for  “linearizing”  a  given  nonlinear  problem  is  clearly  very 
strong. 

Therefore,  whenever  possible,  the  control  problem  posed  must  be 
suitably  transformed  to  bring  it  into  an  appropriate  form  that  enables  the 
implementation  of  linear  control  design  techniques.  However,  the 


systematics  underlying  such  modifications  by  transformation  are  by  no 
means  self-evident.  The  simplest  of  these  modifications  is  a  first  degree 
linear  approximation  of  the  nonlinear  model  by  calculating  a  series 
expansion  at  a  nominal  operating  point.  The  validity  of  this  approximation 
depends  on  the  relative  size  of  the  higher  degree  terms.  In  systems  where 
nonlinearities  are  strong,  these  higher  degree  terms  cannot  be  neglected, 
and  the  approximation  fails. 

The  question  of  whether  a  nonlinear  system  can  be  equivalent  to  a 
linear  system  under  some  group  of  transformations  such  as  change  of 
coordinates  will  be  one  of  the  main  issues  of  this  report.  This  question  has 
been  addressed  by  many  researchers.  The  earliest  example  in  this  area  was 
solved  by  Poincare  [42],  who  gave  a  sufficient  condition  for  the 
linearizability  of  a  vector  field  around  a  critical  point  by  changing  state 
coordinates.  With  the  introduction  of  differential  geometric  techniques, 
the  method  of  linearizing  transformations  under  a  nonlinear  change  of  state 
coordinates  and  nonlinear  state  feedback  was  developed  by  various 
researchers. 

Krener  [27]  discussed  the  question  of  when  a  nonlinear  system  can  be 
transformed  into  a  linear  system  by  a  change  of  state  coordinates. 
Jakubczyk  and  Respondek  [25],  and  Hunt  and  Su  [17]  independently 
considered  the  full  state  feedback  and  coordinate  change  problem.  This 
problem,  solved  in  a  slightly  more  general  setting  by  Hunt  and  Su,  has  been 
since  coined  as  the  “Hunt-Su  Linearization  Method”,  and  it  is  one  of  the 
most  important  developments  in  the  field.  The  Hunt-Su  linearization 
method  gives  necessary  and  sufficient  conditions  for  there  to  exist  locally  a 
coordinate  transformation  and  feedback  that  carries  a  nonlinear  system  into 


a  linear  system.  In  [29],  Krener  considered  the  case  in  which  one  can  find 
an  approximate  linearization  by  considering  the  second  and  higher  degree 
terms  in  the  truncated  series  expansion  of  the  vector  field,  and  proved  a 
weakened  version  of  the  Hunt-Su  linearization  condition.  In  [35]  and  more 
recently  in  [34],  further  results  in  the  solution  of  the  resulting 
transformations  were  presented. 

Many  fine  applications  of  the  above  developments  have  since 
appeared  in  literature.  In  [38,39]  Meyer,  Su,  and  Hunt  have  successfully 
applied  these  techniques  to  automatic  flight  control.  Krener  in  [30]  has 
suggested  a  new  approach  to  compensator  design  in  the  same  framework. 
Freund  [13]  applied  these  methods  in  robot  control  even  before  the  theory 
was  fully  developed.  Other  applications  in  robotics  appeared  in  [11,45].  In 
[24],  Isidori  and  Ruberti  have  solved  the  input-output  linearization 
problem  for  a  system  with  output,  where  the  goal  is  to  find  a  state  feedback 
law  such  that  the  input-dependent  part  of  the  output  of  the  closed-loop 
system  is  linear  in  the  new  input. 

Important  theoretical  results  and  applications  were  also  developed  on 
the  dual  problem  of  nonlinear  observers.  In  order  to  construct  an  observer 
for  a  nonlinear  system,  a  suitable  coordinate  change  is  first  found  which 
transforms  the  system  into  an  observer  canonical  form.  Then  an  observer 
with  linearizable  error  dynamics  is  constructed  in  the  new  coordinates. 
This  approach  to  the  nonlinear  observer  problem  was  first  proposed 
independently  by  Krener  and  Isidori  [33]  and  Bes'le  and  Zeitz  [8].  Krener 
and  Respondek  in  [36]  extended  the  problem  to  multi-input  cases.  An 
application  of  this  method  appeared  in  [  12]. 


In  the  next  chapter  of  this  report,  the  Normal  Form  Theorem  of 
Poincare,  the  Hunt-Su  linearization  condition,  Krener's  results  on 
approximate  linearization,  and  the  results  of  [35]  will  be  reviewed  in  detail. 
These  topics  are  the  most  relevant  prior  developments  on  which  this  report 
is  based. 

The  objectives  of  this  report  are  to: 

(1)  Review  the  current  developments  and  the  mathematical 
background  necessary  to  establish  a  good  understanding  of  the  approximate 
linearization  of  nonlinear  control  systems.  Our  approach  here  will  be 
from  an  engineering  point  of  view,  and  explicit  reference  to  advanced 
mathematics  will  be  kept  to  a  minimum. 

(2)  Given  a  nonlinear  control  system  in  state-space  form  that  consists 
of  n  first-order  differential  equations,  find  a  general  solution  to  the 
problem  of  approximate  linearization  by  state  feedback  and  coordinate 
change.  Here  we  derive  the  set  of  linear  equations  from  the  Homological 
equations.  Solution  to  this  system  of  linear  equations  is  equivalent  to  the 
solution  of  the  linearization  problem. 

(3)  Present  an  efficient  method  of  solution  to  the  Generalized 
Homological  equations.  In  general,  the  solution  will  not  be  unique.  A 
solution  is  found  that  is  optimal  in  some  statistical  sense. 

(4)  Incorporate  the  method  of  solution  in  a  computer  program.  The 
derivation  and  the  solution  of  the  homological  equations  is  extremely 
tedious.  The  structure  of  the  equations  is  dependent  on  the  order  of  the 


system  being  linearized.  A  computer  program  that  automates  the 
procedure  has  been  written  in  the  MATLAB  program  language  . 

(5)  Illustrate  the  method  using  examples.  Using  the  computer 
program  that  solves  for  the  transformations  as  an  aid,  linearize  an  example 
control  system  up  to  second  degree  terms  in  its  series  approximation,  thus 
demonstrating  the  program  as  a  control  system  design  tool.  By 
simulations,  compare  the  response  of  a  control  system  that  has  been 
linearized  up  to  higher  degree  against  the  response  of  the  same  system 
linearized  only  up  to  first  degree. 

1.3  Outline  of  the  Report 

The  report  is  organized  as  follows: 

In  Chapter  2  some  mathematical  preliminaries  that  are  directly 
relevant  to  this  work  are  reviewed.  Various  mathematical  tools  such  as  Lie 
derivatives.  Lie  brackets,  and  distributions  are  introduced.  Controllable, 
controller,  observable  and  observer  normal  forms  for  linear  and  nonlinear 
systems  are  presented.  The  importance  of  the  nonlinear  controller  form  in 
our  work  is  emphasized.  The  Hunt-Su  linearization  condition  [17]  and  the 
extension  of  this  result  to  the  approximate  linearization  of  control  systems 
by  state  feedback  and  coordinate  change  [29]  are  explained.  The  results  of 
these  two  papers  are  central  to  the  report. 

In  Chapter  3  Poincare's  Normal  Form  Theory,  and  its  relevance  to 
higher  degree  approximations  of  control  systems  is  discussed,.  The 
problem  of  higher  degree  approximations  to  nonlinear  control  systems  is 
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stated,  and  references  [34,35],  which  consist  of  some  preliminary  work  of 
this  report,  are  reviewed.  The  Homological  equations  of  PoincarS  are  the 
starting  point  for  the  derivation  of  the  Generalized  Homological  Equations. 
A  solution  to  the  generalized  homological  equations  yields  the  equivalent 
solution  to  the  linearization  problem  with  coordinate  change  and  feedback. 
For  the  sake  of  analysis,  an  appropriate  basis  is  introduced  for  the 
expression  of  higher  degree  monomials  in  the  series  expansion  of  a  vector 
field.  The  generalized  homological  equations  are  evaluated  with  the  aid  of 
this  basis.  An  equivalent  system  of  linear  equations  to  be  solved  is 
obtained.  Properties  of  this  linear  mapping  are  discussed.  An  optimal 
solution  which  provides  the  best  approximation  in  some  statistical  sense  is 
presented  for  the  case  when  there  is  no  exact  solution  to  the  linearization 
problem. 

Chapter  4  presents  the  analysis  of  the  kernel  and  the  co-kemel  of  the 
linear  mapping  that  is  equivalent  to  the  generalized  homological  equations, 
and  derivation  of  the  linear  system  of  equations  that  is  equivalent  to  the 
homological  equations.  For  the  second-degree  linearization  problem  of  a 
single-input  control  system  it  is  shown  that  the  kernel  of  the  mapping  is 
always  of  dimension  one.  The  analysis  of  the  co-kemel  of  the  mapping  is 
far  more  complicated.  The  co-kemel  equations  are  derived  using  a 
formula  for  repeated  Lie  derivatives.  These  equations  represent  the 
relationships  that  have  to  be  satisfied  among  the  coefficients  of  the  second 
degree  terms  in  the  vector  field  in  order  the  system  to  be  exactly 
linearizable  (up  to  second  degree  for  the  case  we  have  analyzed).  Next,  the 
method  of  solution  for  the  system  of  linear  equations  derived  in  Ch.  3  is 
presented.  All  of  the  terms  that  appear  in  the  homological  equations  are 


expressed  in  a  suitable  basis.  These  expressions  enable  us  to  numerically 
calculate  the  coefficient  matrix  in  the  linear  system  of  equations,  which  is 
implemented  in  the  computer  program. 

In  Chapter  5,  the  computer  program  used  for  solving  the  equations  is 
explained  in  detail.  This  program  takes  the  first  and  second  degree  parts  of 
a  nonlinear  control  system  as  input,  and  solves  the  equivalent  set  of  linear 
equations  that  are  obtained  from  the  generalized  homological  equations.  It 
is  written  in  the  MATLAB  application  program  and  incorporates  into  the 
solution  of  the  problem  all  the  results  of  Chapters  3  and  4  in  an  efficient 
way.  Next,  various  nonlinear  control  systems  are  considered  as  examples 
to  demonstrate  the  efficiency  of  the  approximate  linearization  method.  The 
examples  chosen  are  control  systems  that  are  either  exactly  linearizable  up 
to  second  degree,  or  systems  that  yield  only  an  approximate  linearization. 
Comparisons  are  made  in  the  time-domain  responses  against  first-degree 
approximations  of  the  same  systems.  The  response  plots,  the  performance 
and  effectiveness  of  the  method,  and  their  significance  are  discussed. 

Finally,  in  Conclusion,  the  results  are  summarized  and  the 
significance  and  implications  of  this  study  in  control  system  science  are 
discussed.  Possible  future  research  topics  in  this  area  are  suggested. 

The  appendix  presents  a  sample  session  of  the  MATLAB  program 
for  Approximate  Linearization  of  Control  Systems  that  was  recorded 
during  running  the  program. 
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2.  MATHEMATICAL  BACKGROUND 


This  section  will  aim  to  clarify  the  connections  between  the  classical 
treatment  of  linear  control  systems  and  the  more  recent  results  in  nonlinear 
control  systems.  To  this  end,  we  will  introduce  some  mathematical 
definitions  and  results  that  are  not  traditionally  used  in  control  system 
analysis.  Our  central  focus  in  presenting  the  mathematical  background  will 
be  toward  a  precise  statement  of  the  linearization  problem  which  this 
report  addresses.  We  will  closely  follow  prior  work  by  Isidori  [21,22], 
Banks  17],  and  Krener  [27,29,30,32]. 


2.1  Preliminaries 

We  introduce  some  notations  and  definitions: 

R":  n-dimensional  Euclidian  space. 

M:  a  paracompact,  connected  C°°  manifold  of  dimension  n. 

V{M)\  the  real  linear  space  of  C°°  vector  fields  on  M. 

set  of  real-valued  functions  on  M. 

TX(M):  tangent  space  to  M  at  x  eM  (a  copy  of  R" ). 
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TM: 


the  tangent  bundle  over  M;  the  union  ^  Tx  M  of  tangent 


spaces. 


V  (M):  the  real  linear  space  of  C°°  one-forms  on  M,  dual  to  V. 


A: 


a  distribution  on  M.  A^  i\Tx  (Af),  V  ;c  g  Af 


The  differential  operator  d:  >V  (M)  is  defined  by 


dh  =  (dh/dx)dxl  +  —  +  (dh/dx)dxn 


for  h  g  C°°(M). 


For  any  one  form 

w  =  Wjdjtj  +  —  +  wndjcn, 


and  vector  field 


/  =  /i(  9/8*,)  +  -  +  /n(3/3x„) 


the  dual  product  <w,/>  is  defined  as  the  scalar  function 

Wi/i  +  -  +  wjn. 


Usually,  a  vector  fie  Id /g  V(M)  is  denoted  as  a  column  vector 

/=  (A  ,  •••■/„  )T 

jjf 

and  a  one  form  w  g  V  (M)  is  denoted  as  a  row  vector 
W  =  (w,,  w„). 

A  function  h  g  C°°(Af )  defines  a  one  form 
d/i  =  (3/i/3j c,,  — ,  3/i/3jc„). 
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Definition  2.1.1  Let  feV(M).  There  are  three  kinds  of  Lie 

derivatives  related  to/,  which  are  expressed  as  follows: 

(i)  The  Lie  (or  directional)  differentiation  of  a  scalar  function  h  with 
respect  to  /: 

h  €  C°°(M);  Lf  :  C°°(Af )  ->  C°°(M), 

n 

Lf(h)  =  <d h,f>  =  L  (x)  5— . 

This  is  more  precisely  the  derivative  of  the  function  h  in  the  direction  of 
the  vector/. 

(ii)  The  Lie  derivative  of  a  vector  field  g  with  respect  to/: 
g  €  V(M)\  Lf  :  V(M)  ->  V(M), 

£/(*)  =  [/.*]= 


This  is  called  the  Lie  bracket,  and  it  is  also  denoted  by  adj-g. 

(iii)  The  Lie  derivative  of  a  one-form  w  with  respect  to/: 
w  e  V*(M);  Lf  :  V*(M) 

Lf(w)  =  d  <w,f>  =  (  ^"/)r  +  vv 

where  the  superscript  T  denotes  the  transpose  operator. 

Higher  derivatives  can  also  be  defined  by  induction  as  follows: 
(a)  Ljh  =  h,  Lxf  h  =  <d^, />,...,  L](/i  )  =  L}  (L}“  1 /1  ) 


(b )acPfg=g,  ad}g  =  ^f-^g,...,adlfg  =  [f,adlf  lg  ] 
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(c)  Lf\v  =  w,  LjW  =  d <w,/>,  ...  ,  Ljw  =  Lj(Lj-  !w  ) 

These  three  types  of  Lie  derivatives  are  related  by  the  Leibnitz 
formula: 

<Lfw,g>  =  <w,adfg>  +  Lf<w,g>  (2.1) 

Definition  2.1.2  A  set  of  C°°  vector  fields  {X1,  ...,  Xa]  on  M  is 
involutive  if  there  exist  C°°  functions  cj/(x)  such  that 

d 

[  X\  XJ]=  E  (jc)X*  (*),  1  <i,j<d\  i*  j.  (2.2) 

Jb=l 

Definition  2.1.3  A  distribution  A  on  a  manifold  M  is  a  mapping 
assigning  to  each  point  p  of  M  a  subspace  A (p)  of  the  tangent  space  Tp(M) 

to  M  at  p. 

Definition  2. 1 .4  A  distribution  is  nonsingular  on  U,  an  open  subset 
of  M,  if  there  exists  an  integer  d  such  that  dim  A  ip)  =  d  for  all  p  e  U. 


2.2  Normal  Forms  for  Linear  and  Nonlinear  Systems 

A  state  space  description  of  a  controllable  linear  system  can  be  transformed 
to  controllable  or  controller  form  by  a  linear  change  of  state  space. 
Similarly,  a  state  space  description  of  an  observable  linear  system  can  be 
transformed  to  observable  or  observer  form  by  a  linear  change  of  state 
variables.  In  the  context  of  this  section,  controllable,  observable, 
controller,  and  observer  canonical  forms  are  called  “Normal  Forms.”  The 
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definition  of  normal  forms  slightly  differ  in  literature.  We  follow  that  of 
[26]. 

In  this  section  we  will  present  normal  forms  for  linear  and  nonlinear 
systems.  This  will  be  done  in  a  systematic  way  that  is  very  suitable  for  the 
extension  of  these  concepts  to  nonlinear  systems.  The  treatment  closely 
follows  [32]. 

We  introduce  some  notations  and  definitions.  The  indices  lv  ...,/ 
are  positive  integers  summing  to  n. 

Definition  2.2.1  A  prime  triple  (A,  B,  C )  with  indices  /j,  ...,/  is  a 
triple  of  block  diagonal  matrices  of  dimension  nxn,nxq,  and  q  x  n  of  the 
form 

01  •  •  •  0 

•  « 

•  • 

i 

0  0  •  ■  •  0 


B, 

0 

0  ■ 


and 
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;  where  Ci  =  [10 


0  ]  '  x  '■  (2.3c) 


Consider  a  linear  system  described  by 


4  =  FH,  +  Gu 

y  -  Hh, 


is  said  to  be  controllable  if 


rank  { Fr  :j  =  1,  ...,  m;  r  =  1,  n}  =  n. 

The  system  (2.4)  is  observable  if 

rank  {//i-/J’r  _1:  i  -  1,  r  =  1,  =  «. 


(2.4a) 

(2.4b) 


where  Fn  xn^Gnx  an(j  Hpxn,^e  Rn,  u  e  Rm,  and  y  g  Rp.  The  system 


(2.5) 


(2.6) 


Note  that  in  Eqns.  (2.5)  and  (2.6)  while  Fr  denotes  the  r’th  power  of  F,  G ' 
and  Hi  denote  the  / th  column  of  G  and  i"th  row  of  H,  respectively. 


The  controllable  form  of  a  linear  system  is 


x  =  Ax  -  aCx  +  Bu 


y  =  yx 


(2.7a) 


(2.7b) 


where  (A,  B,  C)  is  a  prime  triple  with  indices  /,,  ...,  lm,  and  a  and  y  are 
arbitrary  matrices  of  dimensions  n  x  m  and  p  x  n.  For  example,  when 
m  =  1,  i.e.  for  a  single-input  system,  (2.7a)  takes  the  form 
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II 

-H 

1 

1 

'  ‘  •  P 

I 

O 

X  + 

"0“ 

O  *- 
1 _ 

i 

•  Q  t-H 

_ 1 

The  observable  form  of  a  linear  system  is 
x  -  Ax  -  Bax  +  Pm 


(2.8a) 


y  =  Cx  (2.8b) 

where  (A,  B,  C)  is  a  prime  triple  with  indices  lv  ...,  lp,  and  a  and  P  are 
arbitrary  matrices  of  dimensions  pxn  and  n  x  m. 


A  system  (2.4)  can  be  transformed  into  controllable  form  (2.7)  by  a 
linear  change  of  state  coordinates  if  and  only  if  it  is  controllable. 
Similarly,  a  system  (2.4)  can  be  transformed  into  observable  form  (2.8)  by 
a  linear  change  of  state  coordinates  iff  it  is  observable. 

The  controller  form  of  a  linear  system  is 

x  =  Ax  -  Bax  +  B$u  (2.9a) 

y  =  yx  (2.9b) 

where  (A,  B,  C)  is  a  prime  triple  with  indices  lp,  and  a,  p  and  y  are 

arbitrary  matrices  of  dimensions  mxn,mxm,pxn  except  P  must  be 
nonsingular.  Define  a  pseudo-output  y/  for  the  system  (2.4) 


yf  =  Kt 


(2.10) 


where  K  is  an  m  x  n  matrix  such  that 


KiFr~lGj  = 


0 


1  <  r  <  lj 


(2.11) 


The  observable  form  realization  of  (2.4a)  and  (2.10)  is  a  controller  form 
realization  of  (2.4).  In  other  words,  the  controller  form  (2.9)  is  actually 
the  observable  form  of  the  original  system  (2.4)  with  respect  to  the 
pseudo-output  (2.10). 


The  observer  form  of  a  linear  system  is 


x  =  Ax-clCx  +  $u  (2.12a) 

y  =  yCx  (2.12b) 


where  (A,  B,  C)  is  a  prime  triple  with  indices  lv  ...,  lp,  and  a,  p,  y  are 
arbitrary  matrices  of  dimensions  n  xp,  n  x  m  and  p  x  p  except  y  must  be 
nonsingular.  One  can  define  a  pseudo-input  p 


where  Q  is  an  n  x  p  matrix  defined  as 


HiFr-xQ>  = 


0 

isi 


1  <  r  <  /, 
r  =  /. 


(2.13) 


(2.14) 


The  controllable  form  realization  (2.12)  of  (2.4)  is  an  observer  form 
realization  of  (2.4)  with  respect  to  the  pseudo-input  p. 

Next  we  consider  a  nonlinear  system  described  by 


4  =M)  +  g&u 


(2.15a) 


y  =  h{\)  (2.15b) 

where  %  e  Rn,  u  e  Rm,  y  e  Rp,  and  /,  g,  h  are  smooth  C°°  functions. 
There  are  four  normal  forms  for  the  nonlinear  system  (2.15),  defined  as 
follows: 

Observable  Form  : 

x  =  Ax  -  Ba(x)  +  ${x)u  (2.16a) 

y  =  Cx  (2.16b) 

where  (A,  B,  C )  is  a  prime  ripie  with  indices  lv  ...,  lp,  and  a  and  P  are 
smooth  matrix  valued  functions  of  x  with  dimensions  mxl  and  nxm.  To 
illustrate  how  one  obtains  the  observable  form,  we  write  (2.16a)  in  explicit 
form: 

y  i  ~  ^i:  1 

4l  =  xi:  2  +  Pi:  lW“ 

xi:l.  =  -  a£x)  +  P ufx)u 

for  i  =  1,  ...,/?.  We  make  note  of  the  notation  in  which  we  are  using  the 
col^n  to  separate  the  first  index,  which  represents  the  component  of  the 
output,  from  the  succeeding  subscript,  which  represents  the  number  of 
derivatives  from  the  output.  Thus  xrj  is  the  (j  -  1  )-st  time  derivative  of  the 


ith  output  y(.  Since  the  output  yi  is  equal  to  in  ^  coordinates  (Eqn 
(2.15b)),  the  following  coordinate  transformation  relates  (2.15)  and  (2.16): 

xi:  l  =  h&) 
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xk2 = m 

xi:/  =  Lf'ih' m  for  j  =  1,  (2.17a) 

and 

=  Lgj(Lrfl{hi  ))(4)  for  r  =  1,  ...,  li  ;  7  =  1,  /t . 

—  oci  =  Lfi(hi  )(^)  (2.17b) 

Next  we  present  the  Controller  Form : 
i:  =  Ajc  -  #a(jt:)+  Z?P(x)w  (2.18a) 

y=y(x)  (2.18b) 


where  04,  C)  is  a  prime  triple  with  indices  /j, lm  and  a,  (3  and  y  are 
smooth  matrix  valued  functions  of  x  with  dimensions  m  x  1,  m  x  m  and  p 
x  1 .  To  obtain  the  nonlinear  controller  form,  one  chooses  a  pseudo-output 
y  =  &(%),  where  y  e  Rm  and  construct  the  observable  form  relative  to  y 

such  that  in  x  coordinates  y  =  Cx.  The  observability  indices  of  y  are 
/,, ...,  lm  and  the  coordinates  are  chosen  as  derivatives  of  this  “output”: 

xvj  =  )(4)  fory  =  1,  ...,  i  -  1,  ...,  m.  (2.19a) 


and 


P.v  =  LJL/'r'lVi  »© 

-  a,.  =  L/Kyfi  X?) 

We  note  the  similarity  of  controller  form  to  observable  form. 


(2.19b) 


Controllable  Form  : 

x  =  Ax  -  a(Cx)+  Bu  (2.20a) 

y  =  y(x)  (2.20b) 

where  (A,  B,  C )  is  a  prime  triple  with  indices  / j,  ...,  lm  and  a,  y  are 
smooth  matrix  valued  functions  of  x  with  dimensions  n  x  1  and px\.  We 
emphasize  an  important  property  of  the  nonlinear  controllable  form:  while 
a  is  a  function  of  a  pseudo-output  y  =  Cx,  the  output  y  is  a  function  of  x. 
If  a(y)  is  a  linear  function  of  y  then  the  dynamics  (2.20)  of  the  nonlinear 
controllable  form  agrees  with  the  dynamics  (2.7)  of  the  linear  controllable 
form.  Therefore  the  question  of  the  existence  of  a  nonlinear  controllable 
form  is  closely  related  to  the  question  of  linearizing  the  dynamics  (2.20a) 
by  a  coordinate  transformation. 

Observer  Form  : 


x  =  Ax  -  a(Cx)  +  $(Cx)u  (2.21a) 

y  =  y(Cx)  (2.21b) 


where  (/4,  B,  C )  is  a  prime  triple  with  indices  lv  ...,  lp  and  a,  P  and  y  are 
smooth  matrix  valued  functions  of  x  with  dimensions  n  x  1,  n  x  m  and 
p  x  1 .  To  obtain  the  observer  form,  one  defines  a  pseudo-input  <?(^)p, 


and  finds  the  controllable  form  with  respect  to  this  input.  This  explains  the 
similarity  of  controllable  form  to  observer  form. 


2.3.  The  Hunt-Su  Linearization  Theorem 

In  this  section  we  introduce  the  Hunt-Su  linearization  condition  [17,25],  and 
also  present  the  approximate  version  of  this  theorem  by  Krener  [29].  This 
question  is  equivalent  to  the  existence  of  the  nonlinear  controller  form. 
We  present  the  following  theorem  for  m  =  1. 

Theorem.  There  exists  a  change  of  coordinates  of  the  nonlinear  system 
(2.15)  to  the  controller  form  (2.18)  around  the  nominal  point  iff 

(i)  Controllability  condition:  [g(^°),  ...,  adn~j  g(4°)}  span  T^<Rn, 

(ii)  Integrability  condition:  (g(£°),  ...,  ad^2  g(^°)}  is  involutive, 

(iii)  OX  Oi)  =>  {g(4°X  a<Pfg(&°))  is  involutive. 

For  a  complete  proof,  we  refer  the  reader  to  [17,18,25].  In  the  following, 
we  present  a  systematic  method  to  find  the  change  of  coordinates  to 
transform  a  nonlinear  system  into  controller  form.  The  procedure  has 
been  directly  adopted  from  the  proof  of  the  theorem. 

Assume  a  change  of  coordinates  exist.  Define  a  pseudo-output 
\|/(4)  =  Cx(ty  and  note  that 

<dV(^°),  ad'J  g( %°>  =  (V)  =  °  n  "  <  ”  (2.22) 

-7  -/  *  [  [3  *  0  r  =  n 


In  particular  we  have,  for  d\y  *  0: 
dl|,-L  {g(Z,°),...,adnfg(%°)) 

Then,  since  for  any  two  vectors  X1,  XJ 

MV)  =  0  =>  LxiLxj(\ j/)  —  LxjLxi(\y)  =  £[x*ix-J]('K)  =  ® 

we  get  (noting  that  Lj(d\\f)  =  dLy(\y)): 

dy,  d 1  (g(4°), ...,  ad^ f3  g(£°)} 

and,  continuing  in  this  fashion,  we  obtain 
dv.  <&/¥)>  dLj(y)  1  {*(£°), ...,  ad^4  g£°)} 

dy,  dL/y),  dL^(\j/),  ...,  dL^_I(¥) 

Note  that  the  n  one-forms  d\p,  dLy(\j/),  dL^(\|/),  d£"_1(Y)  in  the  above 

are  independent,  and  as  in  (2.19a) 

xi=  if'iv) 

defines  the  coordinate  change.  Once  the  controller  form 
x  =  Ax  -  Ba(x  )+  B$(x)u  (2.18a) 

is  obtained,  the  choice  of  a  feedback 

u  =  -  a(x  )  +  v  (2.23) 


will  linearize  the  dynamics  as: 


x  =  Ax  +  B  v 


(2.24) 


where  v  is  a  new  open  loop  feedback.  Obviously,  (}(jc)  is  assumed  to  be 
nonzero. 

Krener  in  [29]  has  extended  this  proof  to  the  approximate 
linearization  of  control  systems  using  a  series  expansions  of  the  nonlinear 
terms.  Following  [29],  we  first  introduce  some  definitions  as  follows: 

A  distribution  A  has  an  order  p  local  basis  around  5°  if  there  exist 
vector  fields  X1,  ...,  Xd  which  are  linearly  independent  at  5° and  such  that 
for  every  Ye  A  there  exist  functions  ck  such  that 

d 

Y=  £  ck  Xk  +  0(5  -  5°)p  +  1  (2.25) 

/b=l 

The  integer  d  is  the  order  p  dimension  of  A  at  £°.  Such  a  distribution  is 
said  to  be  order  p  involutive  at  5°  if  there  exist  functions  cjf  such  that: 
d 

[  X\  Xj]=  £  cjj  Xk  +  0(5  -  5°)p.  (2.26) 

k  =  1 

Such  a  distribution  is  said  to  be  order  p  integrable  at  5°  if  there  exist  n-d 
independent  functions  hd+v  ...,  hn  such  that 

<dhi  ,Xj>  =  0(5)p.  (2.27) 

Theorem.  (Frobenius  with  remainder)  (Krener)  Let  A  be  a  distribution 
with  order  p  basis  {  X1,  ...,  Xd  }  at  5°-  A  is  order  p  integrable  at  5°  iff  A 
is  order  p  involutive  at  5°- 


A  proof  of  the  above  theorem  can  be  found  in  [Krener 1984a].  Given  the 
nonlinear  system  (2.15)  we  define  distributions 

A*  =  C°°  span  { adj-  g1 :0  <  /  <  k\  j  =  1,  ...,  m } . 

Now  we  state  the  central  result  of  [29]: 

Theorem.  The  nonlinear  system  (2.15)  can  be  transformed  with  a 
coordinate  change 

x=x{^)  (2.28a) 

andfeedback 

u  =  n(£,v)  =  a(4)  +  P£)v  (2.28b) 

into  the  order  p  linear  system 

x  =  Ax  +  Bv  +0(jc,v)p  +  1  (2.29) 

where  (A,  B)  is  a  controllable  pair  with  controllability  indices  lx  >  ...  >  ln 

iff 

(i)  A*  has  an  order  p  local  basis  at  hf  consisting  of 
{adj  gi:  0  <  /  <  min(A:-,  k)\  j  =  1,  ...,  m) . 

(ii)  A kFx  is  order  p  involutive  at  hf  for  j  -  1,  ...,  m. 

The  proof  for  the  general  case  as  stated  in  the  theorem  can  be  found  in 
[29].  Here  we  will  present  a  simplified  proof  for  m  -  1.  First,  we  restate 
the  theorem  for  m  =  1 : 


Theorem.  The  nonlinear  system  (2.15)  with  a  single  input  u  can  be 
transformed  into  the  order  p  linear  system 

x  =  Ax  +  Bv  +0(x,v)p  +  1  (2.30) 

where  (A,  B)  is  a  controllable  pair  iff 

(i)  An  =  C°°  span  (g(^°),  ...»  ad”f*  g(£,°)}  has  an  order  p  local  basis  at  hf 
consisting  of 

(ii)  A"-1  =  C°°  span  (g(4°),  . . . ,  ad^1  g(§°))  is  order  p  involutive. 

Note  the  similarity  of  the  above  statement  to  the  Hunt-Su  linearization 
conditions. 

Proof.  Assume  the  change  of  coordinates  and  feedback  (2.28)  exist.  Let 
/(£,)  and  g®  denote  the  transforms  of  Ax  and  B  into  £,  coordinates.  It  is 
straightforward  to  verify  that  the  distribution 

An  =  C°°  span  {  g®,  ...,  ad_nf-~l  g® } 

satisfies  (i)  and  (ii)  with  phrase  “order  p”  deleted.  Moreover  from  the 
form  of  (2.28)  one  can  verify  that  A"  and  A"  agree  to  order  p  at  ^°,  i.e. 
any  vector  field  of  one  agrees  with  a  vector  field  of  the  other  to  order  p. 
Hence  (i)  and  (ii)  follow. 

On  the  other  hand  suppose  (i)  and  (ii)  hold.  By  the  controllability 
assumption  A”  is  of  codimension  zero.  An  " 1  is  of  codimension  one  and  by 
(ii)  is  order  p  involutive.  Therefore  we  can  find  a  scalar  function  hft) 
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which  will  annihilate  it  to  order  p  as  in  (2.27).  This  function  and  its  Lie 
derivatives  define  the  desired  linearizing  coordinates  (2.28a) 

x~  Llf\h) ;  i  =  1,  ...,  n. 

In  these  coordinates  the  nonlinear  system  (2.15)  becomes 
' xi+  j  +  0(£,v)p+1  if  i  <  n, 

=  1  (2-31) 

iL^h)  +  LgLlf\h)v  +  0(£,v)p+1  if  i  =  n. 


The  linearizing  feedback  (2.28b)  is  given  by 
u  =  L‘/h)  +  LgL‘fl(h)v. 


(2.32) 
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3.  HIGHER  DEGREE  LINEAR  APPROXIMATIONS  OF 
NONLINEAR  CONTROL  SYSTEMS 


3.1.  Introduction 

In  Chapter  2,  we  introduced  normal  forms,  the  Hunt-Su  linearization 
results,  and  Krener's  extension  of  the  Hunt-Su  method  by  approximate 
linearization  (via  the  series  expansion  of  the  vector  field  around  a  nominal 
point).  In  this  chapter,  the  normal  form  theorem  of  Poincare  will  be 
introduced  first.  This  is  an  approximate  linearization  of  an  autonomous 
vector  field  by  a  nonlinear  (local)  change  of  coordinates.  Next,  we  will 
present  the  approximate  linearization  problem  that  will  be  investigated  in 
this  report,  which  is  formulated  in  a  very  similar  spirit. 

The  fundamental  difference  between  the  Poincare  linearization  and 
Krener's  approximate  linearization  method  needs  some  emphasis.  As  will 
be  discussed  in  the  next  section,  a  sufficient  condition  for  a  linearizing 
transformation  to  exist  for  an  autonomous  nonlinear  system  is  the  so  called 
“resonance  condition”  (see  Sec.  3.2  of  this  chapter)  for  the  eigenvalues  of 
the  linear  part,  or  the  Jacobian,  of  the  vector  field  at  the  nominal  point.  In 
contrast,  one  of  the  necessary  conditions  for  finding  a  coordinate  transform 
and  feedback  pair  that  linearizes  a  nonlinear  control  system  exactly  (the 
Hunt-Su  linearization)  or  approximately  (Krener's  linearization)  is  that  the 
system  be  locally  controllable.  Obviously,  this  requirement  implies  that 
one  is  able  to  freely  assign  the  eigenvalues  of  the  linear  part  of  the  vector 


field.  Thus,  unlike  the  Poincare  problem,  a  resonance  condition  for  the 
eigenvalues  does  not  exist  for  this  case.  Basically,  one  has  the  capability  to 
change  the  flow  of  the  vector  field  through  the  input  (though  only  locally 
within  the  framework  we  presented;  global  controllability  of  a  nonlinear 
system  has  much  more  stringent  requirements).  The  fundamental 
difference,  is  that,  while  in  Poincare's  problem  one  only  wants  to  be  able  to 
understand  and  predict  the  behavior  of  the  flow  of  a  vector  field  by  means 
of  a  closer  approximation  through  coordinate  transformations,  in  the 
problem  of  linearization  by  feedback  and  coordinate  change  the  goal  is  a 
much  more  ambitious  one:  We  first  seek  to  linearize  the  nonlinear  system 
and  ultimately  c  mtrol  its  behavior.  Reminiscent  of  the  connection 
between  the  ♦  ,o  problems,  however,  we  continue  to  use  the  term 
“Homological  Equations”  (after  Arnold  [4])  for  the  set  of  equations  that  we 
will  develop  in  the  solution  to  the  linearization  problem. 

There  are  cases  in  which  the  two  approaches  might  in  fact  be  used 
together.  When  a  nonlinear  system  has  both  controllable  and 
uncontrollable  modes,  one  can  decouple  the  state  space  locally  into 
controllable  and  uncontrollable  submanifolds  by  some  suitable 
transformation.  In  the  controllable  submanifold,  the  approximate 
linearization  problem  may  be  solved  based  on  the  results  presented  here. 
On  the  other  hand,  one  can  decompose  the  uncontrollable  submanifold  into 
stable,  critical,  and  unstable  parts.  An  uncontrollable-unstable  mode  is,  of 
course,  beyond  help.  One  does  not  need  to  worry  about  the 
uncontrollable-stable  modes,  and  can  only  hope  they  will  decay  sufficiently 
fast.  The  uncontrollable-critical  modes  (in  which  the  eigenvalues  are  on 
the  imaginary  axis),  however,  can  be  analyzed  using  Poincare's  method.  In 


fact,  in  the  framework  of  the  analysis  of  nonlinear  oscillations  and 
bifurcations  [14]  the  purpose  of  the  “Center  Manifold  Theorem”  is 
precisely  that.  In  control  systems  literature  there  are  some  fine  examples 
of  work  towards  controlling  the  stability  of  uncontrollable-critical  modes 
of  a  control  system  by  means  of  the  Center  Manifold  Theorem  and 
bifurcation  analysis  (also  called  “Bifurcation  Control”)  [1,2].  Although  this 
problem  may  be  formulated  as  an  extension  of  our  work,  this  topic  is 
beyond  the  scope  of  this  report. 


3.2.  Higher  Degree  Approximations  of  Autonomous  Systems 


Let  us  consider  an  autonomous  system: 

x  =  fix)  (3.1a) 

*(0)  =  jc°.  (3.1b) 

where  x  e  and  the  system  is  assumed  to  be  at  rest  at  the  origin,  i.e. 
/( 0)  =  0.  Without  loss  of  generality  we  will  assume  x°  =  0.  The 
calculations  can  be  easily  repeated  for  x°  *  0.  First,  consider  the 
linearization  of  (3.1)  at  xc: 


x  =  F  x  (3.2a) 

F  =  g(0)  (3.2b) 


We  will  seek  a  coordinate  change  for  (3.1)  of  the  form  identity  plus  higher 
degree  terms,  such  that  the  resulting  system  will  agree  with  (3.1)  up  to  an 
error  of  degree  0(x)p+1  where  p  is  the  degree  of  approximation. 


Obviously,  we  obtain  Eqn.  (3.2)  when  p  =  1.  We  will  now  derive  the  case 
for  p  =  2  and  the  results  will  be  generalized  to  any  arbitrary  degree  p  by 
induction. 


We  assume  a  transformation  of  the  form: 

z  -  x-  <j>^(x)  (3.3) 

where  z  denotes  a  new  set  of  coordinates.  <j)(2\jc)  is  a  polynomial  of  degree 
2,  the  monomial  coefficients  of  which  are  to  be  found.  The  function /(*)  is 
expanded  in  a  series: 

fix)  =/(1)0t)  +f{2\x)  +  0(jc)3 

=  Fx  +f(~2\x)  +  0(x)3  (3.4) 

The  goal  of  the  transformation  (3.3)  is  to  choose  <j>(2)(*)  such  that  in  z 
coordinates  the  dynamics  of  the  system  is  represented  by 

z-Fz  +  0(x)3  (3.5) 


in  other  words,  the  second  degree  terms  in  the  series  expansion  (3.4) 
vanish  under  the  coordinate  change.  We  take  the  time  derivative  of  (3.3): 


Using  (3.1a),  (3.3),  (3.4)  and  (3.5)  evaluate  each  side  in  the  above: 

F(x  -  e\x))  =  Fx  +/(2)( x)  +  OCx)3  (3.6) 

After  some  cancellation,  ignoring  0(jc)3  and  higher  terms,  and  using  the 
Lie  bracket  notation  we  obtain 


/(2)Cr)  =  [FxM2\x)} 


(3.7) 
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Equation  (3.7)  is  called  the  Homological  Equation  [4].  A  similar  derivation 
can  also  be  found  in  [14].  In  Eqn.  (3.7),  /(2)( x )  and  2\x )  are  n- 
dimensional  functions  of  homogeneous  polynomials  of  degree  2.  The  Lie 
bracket  operation  defines  a  mapping 

[Fx,  ■ } :  <t>^(x)  ->  [Fx,(j>(2>(;t)]  (3.8) 

Obviously,  (3.8)  represents  a  linear  mapping  from  n  (n  +  l)/2  dimensional 
parameter  space  of  the  coefficients  of  <J>^(*)  to  an  n2(n  +  l)/2  dimensional 
parameter  space  that  is  the  result  of  the  Lie  bracket  operation.  The 
question  is  whether  f^2\x)  in  the  range  of  this  mapping.  In  other  words, 
is  it  always  possible  to  find  <j>(2)0c)  that  will  satisfy  (3.7)?  This  problem 
was  first  solved  by  Poincar6.  In  the  following,  we  present  a  slightly 
modified  proof  [4,14]: 

Suppose  F  has  a  full  set  of  linearly  independent  eigenvectors.  Then 
we  can  take  these  eigenvectors  as  a  basis,  which  are  defined  by 

Fvk  =  Xkvk  (3.9) 

where  v*  e  Cn  x  1  and  \  e  C.  Similarly  there  exists  a  basis  of  eigenvectors 
of  Ft  defined  by 

wiF  =  Xiwi  (3.10) 

where  wi  e  Clxn.  We  define  a  basis  for  n-dimensional  functions  of 
homogeneous  polynomials  of  degree  2  as  follows: 

(p*y(jc)  =  vk(wix)(wjx)  1  <  i  <  j  <  n  ;  1  <  k  <  n.  (3.1 1) 


Then,  using  this  basis  to  express  the  degree  2  polynomials  in  Eqn.  (3.7),  we 
evaluate  the  Lie  bracket 

,  3  (pic*)  i 

=  ar1?*  -f<PyW 

=  vk(iwix)wj  +  (wjX)wi)Fx  -  Fvk(wix)(WjX) 
and  introducing  (3.9)  and  (3.10)  we  obtain 
!Fz,<p^(.t)]  =  vt(Xj(wix)(w/x)  +  X^WjXXwjx)  -  Xk(wfc)(wjx)) 

=  at  +  x,  -  x^lM)  (3-i2) 

So  the  mapping  (3.8)  is  onto  if  (ki  +  -  Xk)  *  0  for  all  j,  k  =  1,  ...,  n; 

i  -  1,  In  the  literature,  this  is  called  the  resonance  condition.  We 

note  that  this  is  only  a  sufficient  condition.  A  generalization  of  the  proof 
for  the  case  when  F  does  not  have  a  full  set  of  independent  eigenvectors 
may  be  found  in  [4]. 

The  above  proof  can  easily  be  extended  to  an  arbitrary  degree  of 
linearization  as  follows.  Given  an  autonomous  system  that  has  degree  p 
and  higher  nonlinear  terms 

x  =  Fx  +/(p)(jc)  (3.13) 

one  seeks  a  coordinate  transformation  of  the  form 

z  =  x  -  <j>(p)(jc)  (3.14) 

(3.15) 


with  the  goal  to  obtain 
z=Fz  +  0(z)p+1. 


This  leads  to  a  new  homological  equation 
[Fxtfp\x)]  =/(p)  (x). 


(3.16) 
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With  the  assumption  that  the  matrix  F  has  n  linearly  independent 
eigenvectors,  and  using  Eqns.  (3.9)  and  (3.10)  we  choose  a  basis  for  n- 
dimensional  functions  of  homogeneous  polynomials  of  degree  p  as 

9*.  ...,;p(z)  =  Aw,  z)...(wlpz)  for  1  <  ix  <...<  ip  ^  n  ;  1  <  k  <  n.  (3.17) 


Then  a  similar  calculation  yields 


...iWl  =  ft,,  +  •  •  •  +  V  -  ^)<p* . ,„(*)• 


(3.18) 


From  the  above  we  conclude  that  the  condition  of  no  resonance  requires 
that  (A,fj  +  •  •  •  +  A,Ip  -  A,^)  *  0. 


3.3.  Higher  Degree  Approximations  of  Control  Systems 

Generally  speaking,  there  are  two  different  goals  for  linearizing  a 
nonlinear  system.  One  may  seek  a  linearization  for  the  purpose  of 
designing  a  control,  or  alternatively  the  linearization  may  be  tailored  for 
the  purposes  of  estimation.  In  this  section  we  will  attempt  to  find  a 
solution  for  the  problem  of  linearization  for  control.  This,  of  course, 
assumes  full  state  observability.  Let  us  consider  a  nonlinear  system  in 
which  the  control  u  enters  the  dynamics  in  a  linear  fashion: 

x  =  f(x)  +  g{x)u  (3.19a) 

(3.19b) 


jc(0)  -  x° . 


where  x  e  R"  and  u  e  Rm.  The  system  is  assumed  to  be  at  rest  at  the 
nominal  operating  point  (jc°;  u  =  0).  Again,  we  will  assume  x°  =  0. 
First,  consider  the  linearization  of  (3.19)  at  xc: 
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x  =  Fx  +  Gu  (j.20a) 

F  =  |£(0),  G  =  «(0).  (3.20b) 

We  will  seek  a  coordinate  change  for  (3.19)  of  the  form  identity  plus 
higher  degree  terms,  such  that  the  resulting  linear  plant  will  agree  with 
(3.19)  up  to  an  error  of  degree  0(x,  m)p+1  (i.e.  terms  of  0(x)p+1and  OQt, 
u)p )  where  p  is  the  degree  of  approximation.  When  p  =  1,  the  first 
degree  approximation  (3.20)  is  obtained.  Similar  to  the  previous  section, 
the  case  for  p  =  2  will  be  derived  first,  and  the  results  will  be  generalized 
to  any  arbitrary  degree  p  by  induction.  Before  proceeding  further,  the 
nonlinear  functions /and  g  are  expanded  in  a  series: 

f(x)  =  fm(x)  +/(2)  (x)  +  0(;c)3 

=  Fx  +fm(x)  +  0(x)0>  (3.21) 

g(x)  =  g  m(x)  +  g  <>>(*)  +  OW2 

=  G  +  g  Hl(x)  +  O(x)2  (3.22) 

and  the  nonlinear  system  (3.19a)  is  rewritten  as 

x  =  Fx  +f(2\x )  +  (G  +  g  {l\x))u  +  0(x,u)3  (3.23) 

We  assume  the  same  transformation  proposed  in  Sec.  3.1: 
z  =  x  -  tP-Xx) 


(3.3) 


where  z  denotes  the  new  set  of  coordinates.  <j>(2)(x)  is  a  polynomial  of 
degree  2.  In  addition,  a  new  input,  denoted  as  v,  is  chosen  as 
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v  =  a(.r)  +  $(x)u  (3.24a) 

where  a(x)  =  a(2)(jc),  an  n  x  1  vector  of  second  degree  polynomials,  and 
p(x)  =  I  +  P(1)(jc),  an  identity  matrix  plus  first  degree  terms,  both  of 
dimension  mxm.  So  (3.24a)  becomes: 

v  =  a(2)0c)  +  (I  +  P(1)(jc))w.  (3.24b) 

We  note  the  slightly  different  form  of  Eqn.  (3.24);  in  the  literature  (as  well 
as  in  the  treatment  presented  in  Chapter  2  of  this  report),  the  feedback  that 
accompanies  a  coordinate  change  for  linearization  problems  is  usually 
given  as  u  =  a(x)  +  p(x)v.  One  can  always  obtain  one  expression  from  the 
other,  since  by  definition  p(x)  is  nonsingular.  The  above  choice  simplifies 
the  algebra,  as  we  will  see  in  the  following  derivations.  Now  we  want  the 
system  to  become,  in  z  coordinates, 

z  =  Fz  +  Gv  +  0(z,v)3  (3.25) 


The  time  derivative  of  (3.3)  yields: 

_  .  8<j>(2)to  . 

z"'t~  dx  x 


(3.26) 


We  introduce  the  transformation  (3.3),  and  Eqns.  (3.23),  (3.24b)  and 
(3.27)  into  the  above: 

F(x  -  <t>!2,W)  +  G  (<x<2)(a:)  +  (I  +  p(l)W)u) 

=  0  -  ^  XFx  +fm(x)  +  Gu  +  g  (,\x)u  ) 
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Using  (3.28),  Eqn.  (3.27)  can  be  written  as 

/(2)( x)  +  g  (1)(j x)u  =  [F;c,<j)(2)(;t)]  +  Ga(2)(r)  +  [Gu,<$>{2\x)]  +  Gp(1)(jt)w 

(3.29) 

or,  since  terms  of  0(jc)2  are  independent  of  terms  of  O (x,u)2 

/(2)( x)  =  [Fjc,<1>(2)(;c)]  +  Ga{2\x)  (3.30a) 

g  (1)(jc)m  =  [Gh,<]/2^(jc)]  +  Gp^(jc)«  V  constant  u.  (3.30b) 

Because  of  its  similarity  to  the  homological  equations  derived  in  the 
previous  section,  we  call  (3.30)  the  Generalized  Homological  Equations. 


36 


In  order  to  find  an  approximation  of  the  next  higher  degree,  we 
rewrite  (3.25)  by  reverting  to  the  variables  x  and  u  for  convenience: 

x  =  Fx  +  Gu  +  0(jc,m)3  (3.31) 

At  this  point,  we  assume  that  the  second  degree  terms  in  a  given  nonlinear 
system,  if  any,  have  been  already  removed  as  outlined  above.  Then  we 
seek  a  new  transformation  of  the  form: 

z  =  x-<b(3\x)  '  (3.32) 

Note  that  except  for  the  linear  part,  transformation  (3.32)  will  not 
introduce  any  terms  of  degree  less  than  3.  Then  the  same  procedure 
outlined  above  is  repeated,  with  the  feedback: 

v  =  «(3)(jc)  +  (I  +  P(2)W)w  (3.33) 

which,  after  a  series  of  similar  calculations,  results  in  a  new  set  of 
generalized  homological  equations: 

f°\x)  =  [Fjt,<j)(3)(;c)]  +  Ga°\x)  (3.34a) 

g  (2)(x)m  =  [G«,<j>(3)(;c)]  +  Gp(2)(jc)u  V  constant  u.  (3.34b) 

These  results  can  be  generalized  as  follows.  Given  a  system  which  is 
accurate  to  only  degree  p  -  1,  i.  e. 

x  =  Fx  +  Gu  +  0(jc  ,w)p  (3.35) 


a  coordinate  change  is  sought  as: 
z  =  x  -  <})^(jc) 


,(3.36) 
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along  with  feedback: 

v  =  a(p)U)  +  (I  +  ^-l\x))u  (3.37) 

which  yields  the  generalized  homological  equations  to  be  solved: 

f(p\x )  =  [F;t, <1>(p)(jc)]  +  Ga(p\x)  (3.38a) 

g  (p_1)(x)m  =  [Gu,<}>(p)(;c)]  +  G$ip~l\x)u  V  constant  u.  (3.38b) 

In  (3.38),  <j>(p),/^\  cfo\  g  ^-1)and  |^p-1)  are,  respectively,  homogeneous 
vector  fields  of  degrees  corresponding  to  their  superscripts.  The  resulting 
system  is  accurate  up  to  degree  p: 

z  =  Fz  +  Gv  +  0(z,v)p+1  (3.39) 

Once  a  higher  degree  linear  approximation  is  obtained  for  a 
nonlinear  system  one  of  the  important  issues  is  the  stability  of  the  closed 
loop  system.  Thus  one  may  choose,  for  instance,  a  linear  state  feedback  for 
the  approximate  model 

z  =  Fz  +  Gv  (3.40) 

by  setting  v  =  Kz.  The  gain  matrix  K  is  chosen  such  that  in  the  closed  loop 
the  system  gives  the  desired  performance.  If  we  assume  that  the  model  has 
been  linearized  up  to  second  degree,  the  feedback  v  is,  from  Eqn.  (3.24b) 

v  =  <x(2)(jc)  +  (I  +  P(1)(jc))m.  (3.41) 

Using  Eqn.  (3.41),  the  feedback  v  =  Kz,  and  transformation  (3.3)  we 
calculate  the  feedback  law  u : 


Kx  -  K^2\x)  =  a(2)(;c)  +  (I  +  P(1)C*))m 


(3.42) 
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and 

u  =  ( I  +  P(1)(jc))“1(ATjc  -  K^2\x)  -  a(2\x)) 

=  (I  -  p(I)(jc)  +  -)(Kx  -  K^2\x)  -  a(2)(jc>) 

=  Kx  -  { ${l\x)Kx  +  Kty(2\x)  +  a(2)Oc)}  +  0(*,w)3.  (3.43) 

From  (3.43),  the  purpose  of  the  feedback  u  becomes  immediately  clear.  In 
addition  to  a  linear  feedback,  there  are  second  degree  correction  terms 
(placed  inside  curly  brackets  in  (3.43)  for  emphasis).  While  one  chooses  a 
closed  loop  feedback  u  -  Kx  to  achieve  stability,  pole  placement,  etc.  for 
the  first  degree  approximation  (3.20a)  (i.e.  accurate  up  to  first  degree  in 
comparison  with  a  linear  model)  to  get 

x  =  (F  +  GK)x  (3.44) 

the  feedback  (3.43)  introduces  certain  second  degree  terms  and  achieves  a 
second  degree  approximation  (i.e.  accurate  up  to  second  degree  in 
comparison  with  a  linear  model  in  the  z  coordinates)  toward  the  same 
feedback  design  goals: 

x  =  (F  +  GK)x  +/(2)(j c)  +  g  (l\x)Kx  -  G  {  P(1)(jc)/^jc  +  AT< >(2)(jc) 

+  a(2\x)}  +  O  {x,uf.  (3.45) 

One  important  feature  of  the  feedback  (3.43)  and  the  resulting  closed  loop 
system  (3.45)  is  that  one  need  not  transform  the  state  variables  into  the  new 
coordinates  z  that  were  introduced  for  the  sake  of  calculations.  Feedback 
design  can  be  performed  in  the  natural  coordinates  in  which  the  system  is 
originally  presented.  Obviously,  the  above  scheme  assumes  a  priori -that  all 
states  are  available  for  feedback.  If  some  of  the  states  are  not  observable, 


one  can  estimate  the  unavailable  state  variables  by  means  of  an  observer, 
and  apply  the  same  procedure.  This  case  will  be  treated  in  Section  3.5  of 
this  chapter. 


39 


3.4.  Approximate  Linearization  for  Systems  with  Small 
Parameters 

In  this  section,  we  consider  a  control  system  of  the  form: 

x  =/(*,£)  +  g(x,e)u  (3.46a) 

jc(0)  =  x°.  (3.46b) 

where  e  is  a  small  parameter  which  characterizes  the  way  parasitic  effects 
or  disturbances  enter  into  the  system.  We  will  develop  a  method  of 
linearizing  transformation  for  this  type  of  system,  similar  to  that  of  Section 
3.2.  First,  (3.46)  is  expanded  as  follows: 

x  =  Fx  +  Gu  +  e(fil\x)  +  g  w(x)u)  +  0(e)2  (3.47) 

In  (3.47),  the  nonlinear  function  is  expanded  and  grouped  in  powers  of  e. 
Thus,  the  superscripts  of/ and  g  now  represent  the  powers  of  e  multiplying 
these  functions.  Note  that  this  notation  is  different  than  that  of  Section  3.2. 
A  coordinate  change  is  assumed  of  the  following  form: 

z  =  x  -  £<|)(1)(jc)  (3.48) 

where  neither  the  coefficients,  nor  the  polynomial  degree  of  the  function 
<}>(1)(;t)  is  yet  determined  (i.e.  the  superscript  in  this  context  denotes  degrees 


in  e).  After  the  transformation  and  feedback,  we  want  the  system  to 
become 
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k  =  Fz  +  Gv  +  0(e)2  (3.49) 

where 

v  =  u  +  e{<x(1)(x)  +  P(1)(jc)«}  (3.50) 

Note  that  the  superscripts  of  a  and  P  in  (3.49),  as  well  as  the  superscript  of 
<j>  in  (3.48)  represent  the  power  of  e  these  terms  are  multiplied  with.  Thus, 
the  notation  used  is  different  than  that  of  Sections  3.2  and  3.3.  Repeating 
the  calculations  similar  to  Section  3.3  yields  the  homological  equations: 


/(1)( x)  =  [JPjc,<h(1>(jc)]  +  Ga(1)(x)  (3.51a) 

g  w(x)u  =  [Gu,$w(x)]  +  G$l\x)u  V  constant  u.  (3.51b) 

This  result  can  be  generalized  for  an  arbitrary  power  of  e  in  the  same 
fashion:  A  solution  to 

/<p,(j :)  =  +  Gaw(x)  (3.52a) 

g  =  |Gu,4)(p)(j:)]  +  G$((>>(x)u  V  constant  u.  (3.52b) 


will  yield  a  coordinate  transform-feedback  pair  that  will  transform  the 
system  into: 

i  =  Fz  +  Gv  +  0(e)p+1  (3.53) 

Even  though  Eqns.  (3.52)  and  (3.38)  look  very  similar,  there  are  some 
fundamental  differences.  All  the  variables  in  Eqn.  (3.52)  have  different 
definitions  than  those  of  Eqn.  (3.38),  as  mentioned  earlier  in  this  section. 
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Moreover,  the  solvability  conditions  of  (3.52)  are  not  the  same  as  the 
conditions  of  Eqn.  (3.38).  Actually,  (3.52)  may  represent  an  infinite 
family  of  equations  as  opposed  to  the  finite  dimensional  set  of  expressions 
that  arise  from  (3.38). 

Any  nonlinear  system  expressed  in  the  form  of  in  Eqn.  (3.19)  can 
always  be  transformed  into  the  form  of  (3.46)  (and  vice  versa)  as  follows: 
First,  consider  the  expanded  form  of  (3.19),  i.e.  Eqn.  (3.23): 

x  =  Fx  +  fV\x)  +  (G  +  g  (l\x))u  +  0(x,m)3  (3.23) 

Scale  the  coordinates  and  the  input  with: 

C  =  e'1* 

P  =  £_1M 

Introducing  the  above  into  (3.23)  yields 

t  =  FC  +  Gp  +  e(/  (2)(0  +  g  (1)(C¥)  +  0(E)2  (3.54) 

This  equation  is  of  the  form  of  Eqn.  (3.47),  except  for  the  difference  in  the 
fashion  the  expansions  of /and  g  are  defined.  We  use  the  overbar  notation 
to  emphasize  this  point.  The  input 

v  =  a(2)(x)  +  (I  +  $(l\x))u  (3.24) 


is  also  transformed  with  an  additional  scaling  q  =  £  V 
q  =  p  +  £(a(2)(0  +  P(1)(Qp) 


(3.55) 


With  the  above  scaling  of  coordinates,  a  linearization  problem  given  as  in 
Section  3.2  can  be  alternatively  solved  with  the  procedure  outlined  in  this 
section. 


3.5.  Analysis  of  the  Linear  Mapping  for  Linearization  for 
Control 


For  the  case  of  linearization  for  control,  we  derived  the  generalized 
homological  equations  (3.30)  in  Section  3.3.  In  these  equations,  the  second 
degree  terms  /(2)( jc)  and  g  (1)(jc )u  can  be  cancelled  by  proper  choice  of 
<j>(2)C*:),  a(2)(jc),  and  p(1)(x)  under  certain  solvability  conditions.  When  the 
coefficients  of  the  like  terms  in  (3.30)  are  set  equal,  a  linear  mapping  is 
obtained  as 


a(2Yx) 

ip(1)WJ 


W)J 


(3.56) 


A  simple  dimension  count  yields  the  dimensions  of  the  domain  and  the 
range: 


n2(n_+  1)  mn(n+  1)  2 

2  +  2  +  tn  n 


n2(n  +1)  9 

-*  2  +" 


m 


(3.57) 


To  analyze  the  mapping,  we  first  make  a  table  for  the  dimensions  of  the 
domain  and  the  range,  where  n  is  the  dimension  of  the  state  space  and  m  is 
the  dimension  of  the  input  space: 
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For  m  =  1: 

Dimension 

of 

State  Soace 

Dimensions 

of 

Domain  Ranee 

n  =  2 

11 

10 

n  =  3 

27 

27 

n  =  4 

54 

56 

For  m  =  2: 

Dimension  Dimensions 


of  of 


State  Soace 

Domain  I 

n  =  2 

20 

14 

n  =  3 

42 

36 

n  =  4 

76 

72 

n  =  5 

125 

125 

Dimensions  of  the  domain  and  the  range  become  equal  whenever 
n  =  2m  +1.  However,  this  does  not  imply  that  the  mapping  is  of  full 
rank.  For  example,  when  m  =  1,  n  =  3  the  rank  is  26,  not  27.  In  general, 
when  m  =  1,  the  rank  of  the  mapping  is  always  one  less  than  the  dimension 
of  the  domain  for  n  >  3. 

As  described  in  Chapter  2,  a  necessary  condition  for  finding  a 
coordinate  change-feedback  pair  for  a  nonlinear  control  system  is  the  local 
controllability  condition  at  the  nominal  point.  Thus,  for  the  system  (3.23) 
with  a  scalar  input  u  ,  i.e.  m  =  1,  local  controllability  implies 

rank  [G  FG  .  .  .  Fn~lG)  =  n.  (3.58) 

On  the  other  hand,  we  define  a  1  x  n  matrix  K  such  that 
f  0  1  <  i  <  n 

KFl  =  _  (3.59) 

11  i  =  n 


Then, 

rank  [K  KF  .  .  .  KFn~x)  =  n. 


(3.60) 


(3.58)  and  (3.60)  together  imply  that  we  can  define  a  basis  for  n- 
dimensional  second  and  first  degree  monomials  as  follow*  Fir0i  define  as 
a  basis 

vk  =  Fk~lG  (3.61a) 

and  a  co-basis 

wi  =  KFi~l  (3.61b) 

Then  we  define  a  basis  for  second  degree  monomials  as 
9;,0 0  =  vk(wtx)(wjx)  for  j\  k  =  l,  ...,  n  ;  i  =  1,  ...,y.  (3.62) 

and  a  basis  for  first  degree  monomials  as 

9/to  =  vk(wpc)  for  k  —  1,  ...»  n  \  i  =  1,  n.  (3.63) 

Using  the  basis  definitions  (3.62)  and  (3.63)  is  a  great  convenience  for 
calculating  the  Lie  bracket  expressions  that  appear  in  the  generalized 
homological  equations  (3.30).  Calculation  of  (3.30a)  gives 

tf+U  +  <P*v+i  -  9*3*  1  <  i  <  j  <n  ;  1  <  k  <  n 

[Fx,<pkj(x)  ]  =  <  9i+i  j  ~  9  y 1  1  <  i  <  j  =n  ;  1  <  k  <  n 

i  -j  -n  ;  1  <  k  <  n 

(3.64) 


In  the  evaluation  of  (3.64),  when  k  =  n,  the  expressions  become  slightly 
more  complicated.  For  multi-input  problems,  the  above  calculations 
become  even  more  involved.  In  the  following  chapters,  we  will  present  a 
further  simplified  way  to  calculate  the  bracket  in  the  most  general  case  that 
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is  much  more  suitable  for  numerical  implementation.  Next,  we  calculate 
(3.30b) 

'0  i,  j  <  n 

[GtfipO  ]  =  ‘  i  <  j  —  n  (3.65) 

Ml  i  -  j  =  n 


We  can  use  these  two  formulas  to  compute  the  kernel  and  co-kemel  of  the 
mapping 


V2)w' 

'  a(2)(x)  • 


/<2,«  1 

(1)wJ 


(3.56) 


such  that,  we  now  obtain  a  set  of  linear  equations  expressed  in  matrix 
form: 


L 


(3.66) 


9  9 

In  (3.66),  L  is  a  constant  coefficient  matrix  of  n  (n+ 1)/2  +  n  rows  by 

n2(n+ 1)/2  +  n(n+ 1)/2  +  n  columns  that  is  found  from  the  above  evaluation 

4>(2)  1-/(2)- 

of  the  Lie  brackets  of  the  mapping.  a(2)  and  are  the  constant 

Lpo)j  u  (1)j 


coefficients  of  their  corresponding  terms,  stacked  in  a  consistent 
lexicographic  ordering.  For  the  single  input  linearization  problem,  the 
column  rank  of  L  is  (n2(n+ 1)/2  +  n(n+ 1)/2  +  n  -  1). 


A  solution  to  the  linearization  problem  is  developed  as  follows. 
First,  we  note  that  since  the  mapping  (3.66)  is  deficient  in  rank  for  n  >  2,  a 
given  control  system  with  nonlinear  terms  /(2)(x)  and  g  (1)(jc)m  will  not,  in 


general,  have  an  exact  solution  to  yield  a  second  degree  linearization.  In 
fact,  the  Hunt-Su  linearization  result  [17]  (or  Kroner's  extension  of  the 
same  to  the  approximate  linearization  case  in  [29])  is  a  test  for  precisely 
this  condition.  Consequently,  Eqn.  (3.66)  will  not  usually  have  an  exact 
solution  for  n  >  2.  Then,  it  is  reasonable  to  seek  an  approximate  solution 
which  will  minimize  the  error  in  the  linearization  with  respect  to  some 
norm.  In  order  to  give  a  precise  meaning  to  this  problem,  first  assume 
that  we  have  adequate  knowledge  about  the  operating  regime  of  the  control 
system  and  the  desired  accuracy  as  determined  by 

p  (x,u)  :  A  probability  density  function;  typically  uniform  over  some 
compact  set,  or  Gaussian. 

Q  :  A  sensitivity  matrix,  positive  definite. 


Then  define  the  “error” 


( f{2) 'S 


\B 


(i) 


(J  <2A 
U(1)J 


=f  (2)— /  (2)  +  ( 8  (I)-  g(1))u  I Q  p(x,u)dx  d u 


(3.67) 

2  T 

In  Eqn.  (3.67)  the  norm  be  I  is  defined  as  x  Qx.  We  want  to  choose  the 


terms  / (2)  and  g(l)  such  that  the  norm  of  the  above  error  is  minimized. 


r(D 


Note  that 


fjl  2A 

-d) 
\S  J 


is  in  the  range  of  the  mapping,  i.e.  it  satisfies  the 


homological  equations 
f(2\x)  =  [Fjc,<})(2)U)]  +  Ga(2)(jc) 


(3.68a) 


g(]\x)u  =  [Gm,0w(jc)]  +  G|3vv(jc)m 


(2), 


id)/ 


V  constant  u. 


(3.68b) 


Furthermore,  we  wish  to  choose  the  smallest 


that  will  achieve  the 


a(2) 


vp(1V 

above.  Again,  we  choose  positive  definite  matrices  S ,  R  and  minimize 


a 


l<j)(2)l2  +  la(2)(x)  +  {3(1)(jc)m  \2p(x,u)dxdu 

o  R 


(3.69) 


or  one  can  take  a  weighted  combination  of  the  above.  In  Figs.  (2.1)  and 
(2.2)  we  illustrate  the  above: 


Fig.  2.1  represents  the  n2(n  +  l)/2  +  n2  dimensional  parameter  space  for 
the  range  of  the  mapping.  The  second  degree  terms  in  the  given  control 


system  define  a  point  in  this  space,  denoted  by 


The  range  of  L  is 


48 


represented  by  a  straight  line  going  through  the  origin.  Those  points  in  the 
range  space  of  L  that  exactly  satisfy  (3.66)  will  lie  on  this  line.  Among 

these  infinitely  many  points  we  want  to  find  the  one  (shown  as  _(1)  on  the 

U  > 

figure)  which  will  minimize, with  respect  to  a  norm  defined  earlier,  the 
error  between  the  actual  system  that  is  being  approximately  linearized  and 
a  model  which  is  exactly  linearizable  (up  to  degree  2)  by  the  coordinate 
change  and  feedback. 


Fig.  2.2  Domain  of  the  mapping 

Fig.  2.2  illustrates  the  n  (n+ 1)/2  +  n(n+l)/2  +  n  dimensional  domain  space 
of  the  mapping  ,  and  the  minimization  done  in  the  domain  space. 


The  central  issue  in  this  problem  is  how  to  define  the  appropriate 
metric  to  minimize  the  error.  To  this  end,  we  assume  that  the  states  and 
the  input  have  been  scaled  by  their  characteristic  values.  Then,  the 
probability  distribution  function  p (x,u)  in  the  integrals  may  be  assumed  to 
have  zero  mean  and  unit  covariance.  With  this  assumption,  the  matrices  Q, 
R,  and  S  in  the  integrals  of  (3.67)  and  (3.69)  can  be  approximated  by 
identity  matrices.  We  assume  a  basis  for  vector  valued  monomials  of 
degree  2  and  express  the  vector  monomials  as 

<J)(2)(jO  =  4>^<p;y  (x)  (3.70) 

where  cp*.  =  vkxiXj  and  v*  is  the  unit  column  vector  along  the  kth 
coordinate  direction  (see  eqn.  4.13).  Now  the  error  terms  in  the  integrals 
(3.67)  and  (3.69)  may  be  easily  computed.  For  instance,  considering  a 
term  in  (3.69) 

\W2)W2S  =  Wj/  tf/Sityjj  <p*;) 
the  integral  (3.69)  becomes: 

J* 5"  **<t)j[7( |>£-  ^XtXjXjXjpix.iOdxdu 


If  we  assume  x  =  0  and  5  =  /,  the  identity  matrix,  the  above  integral  is: 

1  x[xrx-.x7p(x,u)dxdu  =  +  Pt  -PtJ  +  Pi-Pn  (3.71) 


The  evaluation  of  the  integral  (3.71)  reduces  to  the  simple  calculation  of 
the  fourth  central  moments  of  a  probability  density  function  around  zero 
mean  since  p(;c,w)  is  assumed  to  have  zero  mean  and  unit  covariance  in  the 
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scaled  coordinates.  Even  though  the  matrix  obtained  from  the  above 
fourth  moments  is  not  diagonal,  it  is  assumed  to  be  approximately  equal  to 
the  identity.  This  assumption  greatly  simplifies  the  numerical  calculations, 
and  it  does  not  introduce  a  large  deviation  from  the  actual  value  of  the 
integral  being  minimized. 


To  solve  the  linear  set  of  equations 


L 


(3.66) 


we  use  a  singular  value  decomposition  procedure  as  follows.  First  the 
matrix  L  in  (3.66)  is  decomposed  as 

L  =  UZy7  (3.72) 

U  and  V  in  the  above  are  orthogonal  matrices  and 


where  the  matrix  Er  is 


i.e.  it  contains  the  r  singular  values  of  L  along  its  main  diagonal  and  is  zero 
elsewhere.  Obviously,  the  above  development  assumes  that  the  rank  of  L  is 
equal  to  r,  which  is  in  general  less  than  the  number  of  columns  in  the 


matrix  L.  If  we  denote  the  equation  (3.66)  to  be  solved  as  as  Lx  =  b  for 
brevity,  we  have: 


\\Lx-b\\2  =  ||  UYVTx-b\\2 

=  ||  U(£VTx-d)  ||2  (3.73) 

where  d  is  defined  by  Ud  =  b.  Also  define  a  new  unknown 

y  =  VTx.  (3.74) 

We  note  that  multiplting  a  vector  with  an  orthogonal  matrix  (U  and  V  in 
this  problem)  leaves  the  norm  of  the  vector  invariant.  Then  the  above 
becomes: 

||^-b||2=||  U{Ly~d  )||2  =  |  -d  ||2  (3.75) 

The  minimizing  solution  is  found  as: 
y(  =  dt  /o  •  for  i  <  r 

yt  =  0  for  i  >  r. 

Once  the  values  of  y  are  calculated,  the  original  unknowns  x  are  obtained 
by  an  additional  matrix  multiplication  by  V  (see  (3.74)).  This  yields  the 
least  square  solution  for  the  nonlinear  coordinate  change  and  feedback. 


3.6.  Linearization  for  Tracking  and  Estimation 

In  this  section  we  will  consider  a  slightly  different  control  problem.  The 
development  included  in  this  section  is  beyond  the  scope  of  this  report,  and 


is  included  here  for  the  sake  of  completeness.  This  work  is  due  to  Arthur 
J.  Krener  and  Andrew  Phelps.  Suppose  that  the  control  problem  is  the 
tracking  of  a  reference  signal,  i.e.  we  have  a  p -dimensional  output  signal  y 
(or  its  series  expansion  at  the  nominal  point)  and  a  reference  r(t): 

y  =  h(x)  =  Hx  +  h{1\x)  +  0(x)<3)  (3.76) 

and  the  goal  is  to  achieve 

y(t)  -  r(t)  ->  0.  (3.77) 

In  the  following,  the  linearization  problem  for  degree  2  terms  is  treated. 
We  assume  that  in  the  problem  the  estimation  of  the  states  is  also  required. 


Along  with  Eqn.  (3.23)  we  have  (3.76): 

x  —  Fx  +f\x)  +  (G  +  gW(x))u  +  0(x,«)3  (3.78a) 

y  =  Hx  +  hi2\x)  +  O(x)0)  (3.78b) 

Then  we  consider  a  coordinate  change  on  the  states  as  well  as  on  the 
output: 

z  —  x  —  <J>(2)(jc)  (3.79a) 

W  =  y-  \|/(2)Cy)  (3.79b) 

and  a  new  input  v  is  defined  as 

v  =  a(2)O0  +  (G  +  p(1)O0)u  (3.80) 


Note  that  in  (3.80),  only  the  available  output  is  used  for  feedback.  With 
the  above  feedback-coordinate  transform  pair  we  want  to  obtain 
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z  =  Fz  +  v  +  0(z,v)3  (3.81a) 

w  =  Hz.  (3.81b) 

in  the  new  state  coordinates  z  and  the  new  outputs  w.  The  following 
development  is  similar  to  the  derivations  done  in  Sec.  3.2.  We  take  the 
time  derivative  of  the  coordinate  transformation  (3.79a)  and  introduce 
(3.79a, b),  (3.80),  (3.81)  on  each  side: 

Fix  -  A*))  +  a(2)Cv)  +  (G  +  p0)(y))« 

=  a  -  )(Fx  +f\x)  +  (G  +  g(%))u  ) 


A  calculation  for  the  output  gives,  using  (3.79a,b)  and  (3.81) 

H(x  -  A*))  =  Hx  +  hm(x)  -  v/&\Hx)  +  0(x)3 

By  expanding  and  rearranging  the  terms  in  the  above  equations,  and  using 
the  Lie  bracket  notation  we  obtain  the  Generalized  Homological  equations: 

f\x)  =  [Fx$(2\x)]  +  a  {2\Hx)  (3.82a) 

g(1)0c)w  =  [Gu,ty(2\x)]  +  $(1\Hx)u  V  constant  u.  (3.82b) 

h(2\x)  =  \|/(2)(//jc)  -  H  <D(2)W  (3.82c) 


These  equations  define  a  linear  mapping  as  follows: 


r  A*) ' 


< 


y7>(Hx) 

a2)(llx) 


0)(Hx) 


f\x) 

g(,\x) 

-A*) 


J 


(3.83) 


A  dimension  count  gives  the  dimensions  of  the  domain  and  the  range: 

(3.84) 

The  following  table  shows  the  dimensions  of  the  domain  and  the  range  for 
various  cases: 


For  m  =  1,  p  =  1: 

For  m  =1,  p  =  2: 

State  Space  Domain 

-.Range 

State  Space  Domain 

Range 

n  =  2  11 

13 

n-2  22 

16 

n  =  3  25 

33 

n  -  3  39 

39 

n=  4  49 

66 

n  =  4  66 

76 

For  m  =  l,p  =  3: 

State  SDace  Domain 

Range 

For  m  =2,  p  =  2: 

State  Space  Domain 

Range 

n  =  2 

— 

— 

n-2 

26 

20 

n  =  3 

63 

45 

n  =  3 

45 

48 

n  =  4 

94 

86 

M  =  4 

74 

92 

n  =  5 

138 

145 

. 

• 

• 

As  seen  in  the  tables,  for  larger  state  space  dimensions  and  fewer  input- 
output  pairs  the  mapping  is  rank  deficient.  Again,  one  sets  up  a  least 
square  problem  for  finding  a  solution,  similar  to  Sect.  3.4. 


The  linearization  method  can  be  extended  to  the  problem  of 
estimation  of  the  unavailable  states  as  follows.  Suppose  we  have 


z~Fz  +  <x(2)(w-)  +  (G  +  P(1>(w))h  (3.85a) 

w  =  Hz.  (3.85b) 

Then  we  set  up  an  observer 

/  =  F  £  -  L(w  -  H  &)  +  a (2)(w)  +  (G  +  [5(1)(w))w  (3.86) 

so  that  the  error  is 

'z  =  (F+  LH)  z  (3.87) 

and  the  gain  L  is  chosen  to  achieve  a  stable  dynamics  for  the  observer. 
Considering  the  estimated  states,  we  define  a  coordinate  transformation 

*  =  £  +  <J>(2)(f)  (3.90) 


Then,  with  the  above,  the  error  dynamics  in  x  coordinates  becomes 
&  =  (I  +  - L(w  -H  &)  +  a(2\w)  +  (G  +  $(l\w)))u) 

or 

£  =  F{St  +  <|)(2)( St))  -Liy-  \|/(2)(y)  -  //( St  +  <|>(2)( St)))  +  a (2)(y)  +  Gu 

(2) 

+  p<1)0')«  +  '^jjp-(Fl!-L(y-ym(y)-Ht)  +  Gu)  (3.91) 


Rewriting  the  above  with  the  Lie  bracket  notation  we  finally  get 
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£=F$-L(y-H&)  +  Gu  +  [F  J&,<j>(2)(  £)]  -  [L(y  -  H  £),<j>(2)(  £)] 

+  [Gu,<|>(2)(  &)]  +  a  (2)O0  +  p(1)Cy)M  +  Ly(2)(y)  (3.92) 

Eqn.  (3.92)  is  the  observer  for 

x  =  Fx  +  Gu  +/2\x)  +  g(1)0c)u  (3.93a) 

y  =  Hx  +  /i(2)(jc)  (3.93b) 

Notice  that  the  linear  part  of  the  observer  agrees  with  standard  practice  and 
the  second  degree  part  is  the  correction.  In  the  closed  loop  we  have  the 
feedback 

u  =  {  P(1)(^)/srj^  +  AT<t><2)(^)  +  cx(2)(J^)}  (3.94) 


as  expected. 
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4.  ANALYSIS  OF  THE  TRANSFORMATIONS 

4.1.  Introduction 

In  Chapter  3,  we  derived  the  generalized  homological  equations  fpr  the 
second  degree  linearization  problem  and  introduced  an  equivalent  set  of 
linear  equations.  A  solution  to  this  set  of  linear  equations,  if  it  exists,  will 
yield  the  coefficients  of  the  polynomials  in  the  coordinate  transformation- 
feedback  pair.  In  this  section,  we  first  show  that  the  mapping  is  rank 
deficient.  We  compute  both  the  kernel  and  the  co-kemel  of  the  mapping. 
We  then  describe  a  solution  method  in  detail  aided  by  the  insight  gained 
through  this  analysis. 

A  computer  program  written  in  the  MATLAB  package  that  solves 
for  the  transformations  based  on  the  analysis  of  this  chapter  is  presented  in 
Ch.  5. 

4.2.  Kernel  of  the  mapping 

In  Section  3.2,  the  generalized  homological  equations  were  derived  as: 
fm(x)  =  ;fx,<|><2,«]  +  Ga(2\x)  (4.1a) 

g  =  [Gu, <t>,2)Ct)l  +  GP(1IU)« 


V  constant  u. 


(4.1b) 
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It  was  shown  that  these  equations  describe  a  mapping  for  the  monomial 
coefficients  of  the  terms  in  (4.1): 


(4.2) 


Furthermore,  we  obtained  a  set  of  linear  equations  equivalent  to  this 
mapping,  expressed  in  matrix  form: 


The  derivation  of  the  exact  form  of  the  constant  coefficient  matrix  L  in  the 
above  equation  will  be  discussed  later.  At  this  point  we  assume  that  we 
have  a  nonlinear  system  in  which  the  linear  part  of  the  vector  field  is  in 
Brunovsky  canonical  form: 

x  =  Ax  +f\x)  +  (B  +  g(l\x))u  +  0(x,u)3  (4.4) 


where  A,  B  are  matrices  of  the  prime  triple  (A,  B,C)  (see  Eqns.  (2.3a,b)). 
As  a  matter  of  fact,  if  the  linear  part  of  a  given  system  is  controllable,  one 
can  always  obtain  (4.4)  with  some  appropriate  linear  coordinate  change  and 
feedback.  Furthermore,  we  also  assume  that  the  system  (4.4)  is 
linearizable  up  to  second  degree  in  accr  dance  with  Krener's  Theorem  for 
approximate  linearization  [29]  as  presented  in  Chapter  2.  Then  one  can 
linearize  this  control  system  with  a  coordinate  transformation  and  feedback 
pair 
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* = *  -  <t>®w  (4.5) 

v  =  af(x)  +  (I  +  P<I1)M)«  (4.6) 

to  obtain 

2  =  Az  +  B  v  +  0(z,v)3.  (4.7) 

We  pose  the  following  question  at  this  point:  Can  one  find  some  other 
coordinate  transformation-feedback  pair: 

£  =  z  -  $2  \z)  (4.8) 

U  =  af(2)  +  (I  +  p‘21)(z))v  (4.9) 

similar  in  form  to  those  of  Eqns.  (4.5)  and  (4.6)  such  that,  after  (4.7)  is 
transformed  by  the  above  one  obtains,  in  £  coordinates 

£  =  A?  +  B  H  +  0(£,|i)3  (4.10) 


i.e.  another  linear  system  (up  to  degree  2)?  If  such  a  transformation  and 
feedback  pair  (4.8)-(4.9)  exist,  then  the  original  linearizing  pair  (4.5)-(4.6) 
is  obviously  not  unique,  since  a  combination  of  both  will  yield  (ignoring 
cubic  terms) 

o"’<x)  (4.11) 

(4.12) 

To  show  such  a  transformation  is  indeed  possible,  we  first  choose  the 
"natural"  basis  for  expressing  the  first  and  second  degree  monomials. 
Choose  as  a  basis  the  unit  column  vectors  v*,  k  =  1,  ...,  n: 
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z  =  x  -  ^\x)  (4.5) 

V  =  af(x)  +  (I  +  ^\x))u  (4.6) 

to  obtain 

z  =  Az  +  B  v  +  0(z,v)3.  (4.7) 

We  pose  the  following  question  at  this  point:  Can  one  find  some  other 
coordinate  transformation-feedback  pair: 

C  =  z  -  <t>(2>(z)  (4.8) 

H  =  af(z)  +  (I  +  p<21)(z))v  (4.9) 

similar  in  form  to  those  of  Eqns.  (4.5)  and  (4.6)  such  that,  after  (4.7)  is 
transformed  by  the  above  one  obtains,  in  £  coordinates 

C  =  A?  +  B  n  +  0(£,n)3  (4.10) 


i.e.  another  linear  system  (up  to  degree  2)?  If  such  a  transformation  and 
feedback  pair  (4.8)-(4.9)  exist,  then  the  original  linearizing  pair  (4.5)-(4.6) 
is  obviously  not  unique,  since  a  combination  of  both  will  yield  (ignoring 
cubic  terms) 

?  =  *-<! >32)W  (4.11) 

n  =  a(2,«  +  (I  +  p(3l \xj)u  (4.12) 

To  show  such  a  transformation  is  indeed  possible,  we  first  choose  the 
"natural"  basis  for  expressing  the  first  and  second  degree  monomials. 
Choose  as  a  basis  the  unit  column  vectors  v*,  k  =  1, ...»  n: 
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v*  = 


'0 


L0  J 


<—  k  th  entry 


(4.13) 


Similarly  choose  as  a  co-basis  the  unit  row  vectors  wi ,,  i  =  1, 

wf=  [0  •  •  1  •  •  0]  (4.14) 

with  unity  in  the  i  th  entry  and  zero  everywhere  else.  As  in  Chapter  3,  we 
define  a  basis  for  n-dimensional  vector  valued  functions  of  homogeneous 


polynomials  of  degree  2  and  degree  1  as  follows: 

(p  fa)  =  vk(wiz)(wjz )  fory,  k  =  1,  ...,n  ;  i  =  1,  ...  J.  (4.15a) 

q*r)  =  ^(w?)  for  j  =  1,  ...,n  ;  i  =  1,  (4.15b) 

Similarly,  a  basis  for  scalar-valued  second  degree  polynomials  is  chosen  as: 
htfz)  =  {wtz){wjz)  forj  =  1,  ...,n  ;  /  =  1,  ...J.  (4.16) 

also  for  the  first  degree  terms: 

ht(z)  =  wp  for  /  =  1,  ...,«.  (4.17) 

Now  we  recall  the  proof  of  Krener's  theorem  for  approximate 


linearization.  We  choose  a  scalar  function  h(z),  and  take  this  function  and 
its  Lie  derivatives  as  the  desired  new  coordinates  in  (4.8): 

h(z)  =  y£aihi(z)  +  ’*£aiihifz) 
i=l  1  <i<j 


(4.18) 


Cl  =  h(z) 
c2  =  Ljh(z) 
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(4.19) 

C,=  if'AW 

Comparison  of  the  above  with  (4.5)  implies  that,  since  the  two  coordinates 
z  and  C  have  to  agree  in  their  first  degree  terms,  in  (4.18)  a1  =  1  and  a *  = 
0  for  i  =  2,  ...,  n.  The  coefficients  of  the  second  degree  terms  are  yet  to 

be  determined.  Obviously,  if  all  of  the  terms  a*V  in  (4.18)  are  zero,  then 
the  only  transformation  that  takes  (4.7)  into  (4.10)  is  the  identity,  with 

<|>2  (z)  =  0  in  (4.8).  We  will  show  that  one  can  find  a  scalar  function  h(z) 
with  nonzero  second  degree  terms  (and  therefore  nonzero  (j>  2  (z))  to 

achieve  the  above  transformation.  According  to  the  Hunt-Su  linearization 
theorem  a  transformation  will  exist  if  (see  Chapter  2) 

,  \  0  1  <  r  <  n 

L(Lrf'h(z))  =  (4.20) 

*  1  1*0  r  =  n 


With  the  aid  of  the  chosen  basis,  we  proceed  to  calculate  the  first  n  -  1 
equations  in  (4.20).  First  note  that,  since  there  are  no  second  degree  terms 
in  the  given  control  system  (4.7),  /  and  g  are  equal  to  Az  and  B, 
respectively.  It  is  also  clear  that  (4.20)  will  be  satisfied  if  h  and  its  first  n  - 
1  Lie  derivatives  along /are  not  a  function  of  zn,  i.e. 


if  \y(z)  =  \|f(zj,  z2, zn_ j)  because  the  n  th  entry  in  the  above  one-form 
will  then  vanish.  If  we  assume  that 


h(z)  =  z1  +  a11zl2 


(4.22) 


one  can  easily  show  by  calculating  the  repeated  Lie  derivatives  that  the 
term  zn  will  not  appear  until  the  n  -  1st  derivative.  In  fact,  any  quadratic 
term  in  h  that  is  a  function  of  z,-  will  cause  Lrj~lh(z )  to  be  a  function  of  zn  at 


the  n  -  i  th  derivative.  Therefore  we  establish  that  there  is  a  one- 


parameter  family  of  solutions  dependent  on  the  choice  of  a1*1  in  (4.22)  that 


will  yield  additional  solutions.  Note  that  this  solution  does  not  redefine  the 
input,  and  both  a ^(z)  and  (^(z)  in  Eqn.  (4.9)  are  zero.  To  make  the 


explanation  more  precise,  we  have  found  a  nonzero  vector 


(2) 


a 


which, 


when  added  to  the  original  solution  in  Eqn.  (4.3),  will  not  change  the  right- 
hand  side  of  (4.3).  In  other  words  the  solution  found  belongs  to  the  kernel 
of  the  mapping.  For  a  single  input  system,  (4.22)  is  the  only  possible 
function  which  results  in  a  transformation  that  belongs  to  the  kernel  of  the 
mapping,  and  the  dimension  of  the  kernel  is  equal  to  one. 


4.3.  Co-kernel  of  the  mapping 

In  Chapter  2,  it  was  shown  (via  Krener's  approximate  linearization 
theorem)  that  a  controllable  system 

x=f(x)  +  g(x)u  =  Ax  +/2\x)  +  (B  +  g(1)(x))w  +  0(x,m)3  (4.23) 


is  linearizable  up  to  degree  2  around  x  =  x°  iff  the  distribution 

An-1  =  span  {£(*<>),  g(*°)} 


(4.24) 
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is  involutive  up  to  degree  2.  In  [Krener 1 984a]  it  is  also  proven  that  this  is 
equivalent  to  the  integrability  condition.  For  a  single  input  system  there 
exists  a  nonzero  function  h(x)  (precisely  of  the  form  (4.18))  such  that 

<dh,  adf  g(x)>  =  0(;t)2.  (4.25) 

i.e.  constant  and  first  degree  terms  in  (4.25)  must  vanish  for  a  nonzero  h. 
When  (4.25)  is  not  satisfied,  a  system  is  not  exactly  linearizable  up  to 
degree  2,  and  consequently  an  exact  solution  to 


does  not  exist.  In  this  section,  we  will  attempt  to  find  which  terms  (or 
linear  combinations  of  terms)  in  the  second  degree  part  of  the  vector  field 
cause  the  system  to  be  not  linearizable  (or  equivalently,  non-involutive).  In 

linear  algebraic  terms,  we  are  looking  for  a  specific  vector  or  vectors 

(the  entries  of  this  vector  are  the  coefficients  of  the  second  degree  terms  of 
the  control  system  in  a  particular  basis)  which  will  satisfy  the  adjoint 
equation 


=  0. 


(4.26) 
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The  precise  definition  of  the  co-kernel  of  the  mapping  is  given  by  (4.26). 
However,  once  the  generalized  homological  equations  are  expressed  in 
their  equivalent  linear  form  (4.3),  the  insight  given  by  the  integrability  (or 
involutiveness)  condition  becomes  lost.  Therefore  we  will  try  to  derive  the 
co-kernel  expressions  using  Lie  derivatives  and  by  finding  the  conditions 
that  satisfy  (4.25). 

Eqn.  (4.25)  implies  (see  the  Hunt-Su  theorem  in  Chapter  2) 
dh  1  UU),  adf  g(x),  ...,  ad/~2  g(x)}  up  to  0(x)2.  (4.27) 

Our  derivation  of  the  co-kernel  equations  will  be  based  on  calculation  of 
the  successive  Lie  derivatives  of  g(x)  along /(x),  which  appear  inside  the 
bracket  in  (4.27).  To  simplify  the  expressions,  we  will  first  make  a 
coordinate  transformation  that  will  eliminate  only  the  terms  g(1)(x)w  in 
(4.23).  We  choose  a  coordinate  change 

2  =  c  -  <j>(2)(x)  (4.28) 

and  no  change  in  the  feedback  (which  means  a(2)(x)  and  (3(1)(x)  will  be 
zer  )  in  order  to  satisfy  only  the  second  generalized  homological  equation 
(4.1b) 

*<lirl)=  [B,(D(2>«]  (4.29) 


where  the  input  u  has  been  cancelled  on  both  sides.  We  express  the  left- 
hand  side  of  (4.29)  in  the  basis  defined  in  Section  4.1  (4.15b): 

*(1)M=  X  Si  <P*  (4.30) 
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where  glk  are  the  constant  coefficients  of  g(1)(je)  in  this  basis.  Similarly, 
express  <}>(2)(jc)  in  the  same  basis  defined  in  (4.15a): 

<J>(2)(*)  =  Hj  (Po  (4.31) 

where  <J )jj  are  the  unknown  coefficients  of  the  second  degree  terms.  The 
bracket  on  the  right  hand  side  of  (4.29)  is  calculated  using  (4.31),  noting 
that  B  =  vn  in  the  same  basis  is  given  by 

[B,|)>(2)W]  =  v"  =  tjt  vl((wix)wj  +  (w^)w,.)v" 

=  Id  +  5;  (4.32) 

where  8j  is  the  Kronecker  delta  function,  and  a  summation  over  the  indices 
i,  k  is  implied.  Setting  equal  the  coefficients  of  the  basis  elements  <p*  in 
(4.32)  and  (4.30),  we  obtain: 

§lkn  =  £*/(l  +  5*  )  for  i,  k  =  1,  ...,  n.  (4.33a) 

and 

<J>^'  =  0  for  j  <  n.  (4.33b) 

Thus  we  have  determined  the  form  of  (4.31),  and  the  transformation 
(4.28),  which  will  transform  the  system  into: 

z  =  Az  +  Bu  +/2\z)  +  0(z,m)3  (4.34) 

Note  that  the  coefficients  of  /2)(z)  in  (4.34)  are  not  the  same  as  the 
coefficients  of  /2\x)  in  (4.23),  because  the  transformation  (4.28) 
contributes  new  quadratic  terms  to  the  vector  field.  These  new  terms  may 
be  calculated  via  the  generalized  homological  equation  (4.1a).  Since  the 


derivation  of  the  co-kemel  equations  is  identical,  we  shall  not  calculate  die 
resulting  change. 


Using  (4.34),  Eqn.  (4.27)  may  be  rewritten  as: 
dh  .1  {B,  adjB, ...,  adfn'2B )  up  to  O(z)2  (4.35) 

where  f-Az  +/2\z).  Eqn.  (4.35)  is  the  same  as 

Ladr~'B(h)  =  O(z)2  for  r  =1,  ...,  n  -1.  (4.36) 

We  use  the  well-known  iterative  formula: 

Lad^g(h)  =  Lad'-ig  Lj(h)  -  LfLadr-ig(h)  (4.37) 

which  can  be  written  as: 

57  adlf'B  }  (Az  +/<2))  -  Jj{|:  (Az  +/2)) }  adJf-'B  =  0.  (4.38) 


First,  the  terms  in  (4.38)  are  expressed  in  the  monomial  basis: 


fm  =1 

9- 

=flk'ivk(wiz)(wjz) 

(4.39a) 

B  =  v" 

(4.39b) 

v1"1 

for  2  <  i  <  n 

Av 1  =  • 

(4.39c) 

0 

for  i  =  1 

wi+  i 

for  1  <  i  <  n  - 1 

wfl  = 

(4.39d) 

.0 

for  i  =  n 

h  =  hx 

+  /i'V  = 

w  jZ  +  hi'i(wiz)(\VjZ ) 

(4.39e) 

Then  we  calculate  the  Lie  brackets  using  (4.36)  and  (4.37): 
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adfB  =  [B,  ( Az  +/2))]  =  [vfl,  (Az  +fkJvk(wiz)(wjz))) 

=  (a  +fikJvk((wiz)wj  +  (wjz)wi))vn 

=  v"'1  +  fk-n  v*(l  +  8*  )(w,.z)  (4.40) 

ad^B  =  adfadj-B  ) 

=  (a  +fk'ivk((wiz)wj  +  (wjz)wi))(vn~1+fik‘n  v*(l  +  8*  XWfZ)) 

-  fkn  v*(l  +  8*  )w/lz 

=  Vn~2+fk-n  v^Q+S*  )(wtz)  +fjc'n~l  Al+8nii  )(wtz)  -fik-n  Al+8i)K+1z) 

=  Vw_2  +  1+5*  )  +  /J*n-1(l+8ni1  )  -flk  l’n  (l+8‘  ))(wfz)  (4.41) 

When  the  above  calculations  are  repeated  for  more  steps,  a  general  formula 
can  be  written  for  adfB  as  follows: 

adfB  =  v"  r  +  X  X{  X  vt+i"r4"-'-1(l  +  8 „4+1  ) 

/=1  Jk=l  i=  1 

+  X  ^(-l)?+s  (’)  vk-sfi-t+s*-r+,.i  (1  +  J ) }  (W/K4.42) 

Note  that  in  the  calculation  of  (4.40)  through  (4.42)  O(z)2  and  higher  terms 
have  been  ignored.  Since  we  are  trying  to  calculate 

<dh,adfl  B>  =  O(z)2  (4.43) 

we  evaluate  dh  using  the  expression  (4.39e): 

^-  =  w,  +  tijyiw^Wj  +  ( wJz)wi )  (4.44) 
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Now,  (4.43)  must  be  obtained  by  multiplying  (i.e.  taking  the  inner  product 
of)  (4.44)  and  (4.42)  and  keeping  only  the  constant  and  O(z)1  terms.  This 
yields: 


<dh,adrB>  =  £  { +  S' )  +  £  ffi*1  (1  +  8j_.+1 ) 


+  £  £(-d,+s  a + > }  (*v>  (4-45> 


q=l  s=0 


Then,  using  the  above  equation,  one  sets  the  terms  inside  the  curly  brackets 
{ }  equal  to  zero  for  j  =  1, ...,  n.  Solutions  to  these  equations  yield  the  co- 
kernel  equations.  Because  of  the  complicated  nature  of  Eqn.  (4.45),  the 
calculation  of  the  co-kemel  equations  become  more  difficult  as  the  order  of 
the  system  increases.  However,  this  fact  does  not  make  the  numerical 
calculation  of  the  actual  linearizing  solution  (or  an  approximate 
linearization)  any  more  difficult,  since  the  exact  form  of  the  co-kemels  is 
not  necessary  for  a  numerical  solution. 


4.4  Derivation  of  the  equivalent  linear  system  of  equations  for 
the  solution  with  a  computer  program 

In  the  computer  program,  we  will  solve  the  following  linear  system: 


In  (4.46),  the  coefficients  of  the  second  degree  terms  in  the  vectqr  field 
f{1\x)  and  g^\x)u  are  obtained  using  the  natural  basis  introduced  in 
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Section  4.2,  and  stacked  in  the  right  hand  side  vector  in  a  consistent 
lexicographic  ordering.  Similarly,  unknown  coefficients  of  the  terms 
<j)(2)(x),  a(2)(jc),  and  p(1)(x)  are  stacked  in  the  left  hand  side  vector  of 
unknowns  in  the  same  manner.  The  coefficient  matrix  L  in  (4.46)  is 
constant,  and  it  is  obtained  from  the  set  of  generalized  homological 
equations: 

fa\x)  =  IF. JC,<(><2)(JC)]  +  Ga(2\x)  (4.47a) 

g  l>(x)u  =  [Ga,<j/2'(-t)]  +  Gpll>(.r)M  V  constant  u.  (4.47b) 

When  the  terms  in  (4.47)  are  expressed  in  the  natural  basis  and  calculated, 
we  get  the  following: 

[Fx,<j>(2)(*)]  =  [Fx^j]  =  j((wp)Wj  +  (WjX)w^Fx  -  Fvi(wIz)(wyz)  (4.48) 


We  note  that: 

n 

w j  F  =  X  F:lwl 

(4.49a) 

/= l 

Fvk  =  X  F(kvl 

(4.49b) 

/=i 

where  Fj  is  the  i,  j  th  entry  in  the  matrix  F.  Then  one  can  write,  for  each 
element  of  the  monomial  basis  <p*y., 


[Fxjfij]  =  X  vk{Fjl(wix)(wlx)+  Fil(wjx)(wl  x))  -  Ffvl(yvp)(wjz) 
=  £  F'Vu  +  F'tfi  -Ffv'j 


(4.50) 


The  calculation  of  the  constant  coefficient  matrix  L  in  Eqn.  (4.46)  is  partly 
based  on  the  result  obtained  in  (4.50)  as  follows:  The  dimension  of  the 
monomial  basis  <p*y  is  equal  to  ~  ^  .  We  restrict  the  ordering  of  the 


indices  as  l  <  i  <  j  <  n;  1  <  k  <  n  and  note  that  in  (4.50)  under  the 
summation  of  index  l  we  have  to  interchange  the  subscript  indices  *,  l  or  j,  l 
for  the  first  two  terms  in  the  summation  in  order  to  keep  the  ordering  of 
the  basis  elements  consistent.  This  will  yield  the  new  expression: 


S  Fj  <Pi/  +  i  F/tfj  +  i  F/tf,  -Z  Ffufij 

(4.51) 


l=i+ 1 


M 


/=;+! 


/=! 


where  a  summation  over  indices  \<i<j<n\\<k<n  is  implied. 
Then,  we  collect  the  terms  in  the  right  hand  side  of  Eqn.  (4.51)  under 
dummy  indices  with  overbars  l,J,k  or,  in  other  words,  regroup  the 
terms  under  a  monomial  <p*  j  and  sum  over  the  indices  i,j,k  to  get: 


t^4i=  {(  Sfj8? 


i  =1 


7=i+i 


F]  8- 

1  j 


7=  1 


F'iVi 


£  ^s;)8£ 

i 


-  fUtv.  jpi, 

k  i  J  1  J 


(4.52) 


where  8j-  is  the  Kronecker  delta  function.  Precisely  speaking,  given  the 
reverse  lexicographic  ordering  1  <  i  <j  <  n,  1  <  k  <  n  for  the  entries  in 
the  linear  operator  representation  of  the  homological  equation,  Eqn.  (4.52) 
will  yield  the  value  of  the  (z,Jjfe)th  row-(/J,&)th  column  entry  in  the 
matrix  L  of  Eqn.  (4.46).  To  be  able  to  write  the  right-hand  side  of  the 
linear  set  of  equations,  we  express  the  second  degree  terms/  (x)  as: 
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fmw  = 


?,/F<47 

i<7^7  1 

\<k 


(4.53) 


Note  that  Eqn.  (4.52)  will  yield  only  some  of  the  entries  in  L;  for  finding 
the  entire  matrix,  we  need  to  consider  all  the  terms  present  in  the 
generalized  homological  equations.  To  this  end,  we  now  define  a  unit 
vector  for  m  xl  dimensional  vector  valued  second  degree  monomials  to  aid 
m  expressing  a  (jc): 

0 


Qfj(x)  = 


1 


Lo  J 


(wpcXWjX) 


1  <i<j<n\  1  <\<m. 


(4.54) 


with  the  m  xl  column  vector  equal  to  unity  in  the  A,th  entry,  and  zero 

(2) 

elsewhere.  The  expression  of  a  (x)  in  this  basis  is: 
am(x)=  I  aye* 


1  <i<j<n 
1  <\<m 


(4.55) 


(2) 

Thus,  a  unit  term  corresponding  to  Ga  (x)  in  the  homological  equation 
(4.47a)  becomes,  with  the  aid  of  a  summation. 


ce*  =  I_  (G^  8-87)4 

1 <i£j  J 

1  <k 


(4.56) 


Next  we  define  the  following  unit  vectors  for  a  similar  calculation  of  the 
second  generalized  homological  equation  (4.47b).  Define  the  unit  n  xl 
dimensional  vector  valued  first  degree  monomials  as  before: 


(4.57) 
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<p*CO  =  V*(Wr*) 


and  the  m  xl  dimensional  vector  valued  monomials  in  (jc,u): 
■q  nm  xl 


nf  (x,u)  = 


1 


LO  J 


1  <  i  <  n  ;  1  <  X,,  p.  <  m. 


(4.58) 


with  the  m  xl  column  vector  equal  to  unity  in  the  A,th  entry,  and  zero 
elsewhere.  Equation  (4.47b) 

gW(x)u  =  [Gw,<|>(2)(jc)]  +  G$(l\x)u  V  constant  u.  (4.47b) 

can  now  be  calculated  by  using  the  unit  vectors  as  defined  previously: 

m 

[Guwlj]  =  X  g/Vp;  +  Gj\^ki 

u=\ 


=  I  ( c," s;  +  an]  4  >  <p* «_ 

i<u<»  ‘  1  k  ’  '  *  1  " 

l<£<m 

In  addition,  the  left-hand  side  of  Eqn.(4.47b)  is  expressed  as: 

£0)(*)m  =  X  ^VTWn 

1  <Jt<rr 

We  also  rewrite  the  term  (}(1)(jc)m  as  follows: 

${ ]\x)u  =  X  $lxnU 
1<<<«  K  M 


(4.59) 


(4.60) 


(4.61) 
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and  G$^\x)u  is  represented  as: 

Gn^=  I  (4.62) 

1<j  ,k<n  M 

1</Z<m 

Combining  all  of  the  above,  we  can  calculate  the  coefficient  matrix  L.  We 
summarize  these  results  in  tabular  form: 


p 

Pi" 

Eqn.  (4.43)  g£8]S} 


0 


G?8j8£+G/8j8f  0 


yi(yi  +  1 ) 

A  dimension  count  will  yield  the  size  of  the  above  matrix  as  —  2 - ^  + 

2  ,  n2(n  +  1)  mn(n  +  1)  7  ,  „  „ 

n  m  rows  by  —  2 -  + -  2 -  +  m  n  columns,  as  found  before. 


The  above  table  yields  the  entries  of  the  coefficient  matrix  in  the  linear 
system  of  equations. 
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5.  COMPUTER  PROGRAM  AND  SIMULATIONS 
5.1.  Introduction 

A  computer  program  consisting  of  a  collection  of  routines,  or  M-files, 
written  in  the  MATLAB  application  language  was  prepared  to  implement 
the  results  of  this  report.  The  package  is  intended  for  use  along  with  the 
MATLAB  program,  and  with  the  Control  System  Toolbox  for  MATLAB. 
It  is  used  for  calculating  nonlinear  coordinate  transformation  and  feedback 
pairs  to  linearize  control  systems  up  to  a  specified  degree  in  the  series 
expansion  of  the  nonlinear  terms.  The  current  version  of  the  program 
calculates  transformations  up  to  degree  two  in  the  series  expansion.  After 
obtaining  the  transformation,  one  applies  any  of  the  standard  control  design 
procedures  for  the  resulting  model.  In  the  program  we  have  provided 
tools  for  feedback  pole  placement  for  the  linear  part  of  the  closed  loop 
system.  In  addition,  one  also  has  the  option  to  choose  various  forms  of 
inputs  and  different  initial  conditions  for  simulating  the  resulting  system. 
Comparisons  of  the  performance  of  the  nonlinear  feedback  with  a  linear 
feedback  design  can  be  made. 

In  order  to  run  the  program,  the  folder  (or  directory)  that  contains 
the  Approximate  Linearization  Toolbox  for  Nonlinear  Control  Systems 
should  be  present  in  the  directory  or  folder  of  the  MATLAB  program  and 
toolboxes.  Either  the  MATLABPATH  has  to  be  set  appropriately  (see 
MATLAB  User's  Manual),  or  after  starting  MATLAB  the  subdirectory 
that  contains  the  M-files  of  the  program  has  to  be  the  opened  or  set  to  be 


the  current  working  directory.  The  program  is  menu-driven,  and 
executing  the  M-file  mainmenu  by  typing  its  name  will  present  a  menu 
from  which  the  system  parameters  can  be  entered,  the  linearization 
problem  can  be  solved,  and  the  results  can  be  simulated. 


5.2.  Using  the  Program 

Prior  to  running  the  program,  the  nonlinear  terms  in  the  control  system 
have  to  be  expanded  in  a  series  t )  obtain 

x  =  Fx  +/(2)( x)  +  (G  +  g  {l\x))u  +  0(x,u)3  (5.1) 

where  Fx  +  Gu  is  the  linear  part  of  the  plant,  and/(2)(jc)  +  g^l\x)u  is  the 
second  degree  part.  A  characteristic  scale  for  each  state  should  be  obtained 
and  entered  into  the  program.  For  a  complete  listing  of  the  program  code 
see  Appendix  1.  An  example  session  recorded  during  use  is  also  provided 
in  Appendix  2.  After  a  coordinate  change  and  feedback  have  been  found, 
the  user  can  perform  feedback  design  and  simulate  the  resulting  models. 
The  feedback  gains  are  calculated  in  the  program  either  by  specifying  the 
closed  loop  eigenvalues,  or  according  to  a  quadratic  optimal  regulator 
design  procedure  to  minimize  a  performance  index  C: 

oo 

C  =  min  J(jc TQx  +  uJRu)dt  (5.2) 

Either  procedure  yields  a  set  of  feedback  gains  K.  In  the  program,  a  total 
of  three  different  systems  are  simulated  together: 


1:  A  linear  model  with  linear  feedback  (LMLF) 
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x=  (F  +GK)x  +  Gr  (5.3) 

where  F  and  G  are  identical  to  those  of  the  original  nonlinear  system,  and 
the  same  gains  K  as  found  above  are  used  to  perform  pole  placement  on  the 
model.  The  term  r  is  a  reference  input  signal.  This  system  is  intended  to 
serve  as  a  benchmark  to  evaluate  the  performance  of  the  nonlinear 
feedback  synthesis  procedure.  It  is  expected  that  the  nonlinear  control  will 
drive  the  system  to  behave  approximately  like  (5.9). 

2:  The  nonlinear  system  with  a  linear  feedback  law  (NSLF): 

x  =  (F  +GK)x+f(2\x)  +  g  (l\x)Kx  +  (G  +  g  (1\x))r  (5.4) 

This  is  the  original  nonlinear  system.  After  the  standard  first  degree 
approximation,  the  same  feedback  gains  K  as  in  the  LMLF  are  used.  In 
the  simulations,  the  response  of  this  system  is  compared  to  that  of  the  the 
second  degree  approximation. 

3:  The  nonlinear  system  with  the  linear  feedback  and  the  linearizing 
quadratic  feedback  law  (NSQF): 

x  =  Fx  +fn>(x)  +  (G  +  g(1)W)(l  +  p(1)W)"'  { Kx  -  K<S/a>(x)  -  a,2,(x) 

+  r}  (5.5) 

This  is  the  nonlinear  system  on  which  a  quadratic  feedback  law  is  applied 
in  addition  to  the  linear  feedback  with  gain  K.  Note  that  in  (5.5),  the 
reference  input  r  appears  in  the  bracket.  In  other  words,  the  above  is 
equivalent  to  (in  z  coordinates) 


z  =  (F  +  GK)z  +  r 


(5.6) 


provided  there  is  no  residual  (i.e.  non-linearizable  terms)  after  the 
coordinate  change  and  feedback.  The  nonlinear  feedback  applied  to  (5.5) 
is: 

u  =  (I  +  P(1)(jc))-1  {K(x  -  <j>{2)(*))-  gc(2)(jc)  +  r }  (5.7) 

In  the  program,  one  or  more  of  the  response  curves  of  the  three  models 
may  be  plotted  and  the  performances  can  be  compared.  The  program 
allows  the  simulation  of  the  above  systems  with: 

a)  Zero  initial  conditions, 

b)  Impulse  input  with  zero  or  nonzero  initial  conditions, 

c)  Step  input  with  zero  or  nonzero  initial  conditions, 

d)  A  sinusoidal  forcing  funcion  as  the  reference  signal  with  zero  or 
nonzero  initial  conditions. 

In  the  next  section  we  will  present  simulations  of  four  different 
control  systems,  each  chosen  to  illustrate  a  particular  feature  or  aspect  of 
the  approximate  linearization  method. 


5.3.  Example  Simulations: 

In  this  chapter,  four  example  systems  will  be  simulated.  In  the  simulations 
of  Examples  i,  2,  and  3,  we  will  present  cookbook  nonlinear  systems. 
The  three  examples  are  set  up  to  possess  specific  properties.  The  first 
example  is  exactly  linearizable;  the  second  example  is  not  exactly 
linearizable  but  in  controller  canonical  form;  and  the  third  example  is  not 


exactly  linearizable  and  it  has  open  loop  poles  very  close  to  the  desired 
closed  loop  poles.  None  of  these  three  examples  represent  physical 
systems,  and  each  has  some  isolated  property  as  mentioned  above.  They 
are  intended  to  test  and  prove  the  effectiveness  of  the  nonlinear  control 
scheme  within  the  framework  of  each  of  these  special  properties. 

In  contrast,  Example  4  for  the  approximate  linearization  method  is  a 
physical  system.  The  example  considered  is  a  satellite  in  planar  earth  orbit 
specified  by  its  position  and  velocity  in  polar  coordinates.  The  nonlinear 
equations  of  motion  are  expanded  in  a  Taylor  series  at  a  nominal  earth 
orbit,  and  truncated  at  the  second  degree  term  in  the  series. 

Three  response  curves  will  be  compared  for  every  simulation:  1) 
An  ideal  linear  model,  2)  The  nonlinear  system  with  a  linear  feedback 
design,  and  3)  The  nonlinear  system  with  the  quadratic  feedback  based  on 
the  method  that  has  been  developed.  The  reason  for  including  the  linear 
model  in  the  comparison  is  to  check  the  improvement  achieved  by 
nonlinear  feedback  toward  making  the  system  respond  more  linearly.  In 
other  words,  our  evaluation  of  the  effectiveness  of  the  approximate 
linearization  will  be  based  on  how  close  the  time  response  curves  of  a 
nonlinear  system  with  quadratic  feedback  follows  the  responses  of  a  purely 
linear  system,  and  how  superior  this  improvement  is  in  comparison  with 
the  response  of  a  system  with  a  feedback  design  based  on  a  first  degree 
approximation. 

The  following  table  presents  a  list  of  the  example  simulations,  the 
various  initial  conditions  and  disturbance  inputs  applied  to  each  system,  and 
the  figures  associated  with  each  simulation. 


Example  System  InitialConditi^  Figures 
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Example  1 

jc  =  10.05  0.05  0.05V 

5.1  through  5.7 

Impulse  with  unit  magnitude 

5.8  through  5.14 

Step  with  magnitude  =  0.2 

5.15  through  5.14 

Sinusoidal  with  r(t )  =  sin(6f) 

5.19  through  5.25 

Example  2 

x  =  [0.35  0.35  0.35V 

5.36  through  5.29 

Impulse  with  unit  magnitude 

5.30  through  5.33 

Step  with  magnitude  =  0.5 

5.34  through  5.37 

Sinusoidal:  r(t )  =  sin(6t) 

5.38  through  5.44 

Example  3 

*  =  ro.i  0  OV 

5.45  through  5.48 

x  =  [-0.1  0  0V 

5.49  through  5.52 

jc  =  ro  o  o.iv 

5.53  through  5.56 

X  =  10  0  -O.IV 

5.57  through  5.60 

Sinusoidal:  r(t)  =  sin(6/) 

5.61  through  5.64 

Nearly  Circular  Satellite  (Example  4) 

jc  =  12  0  0  0V 

5.65  through  5.69 

jc  =  r-2  0  0  ov 

5.70  through  5.74 

jc  =  [0  0  1  ov 

5.75  through  5.79 

r — 

o 

1 

o 

o 

II 

H 

5.80  through  5.84 

Sinusoidal:  r(t)  =  30,000sin(6t) 

5.85  through  5.89 

5.3.1.  Example  1: 

For  the  first  example  simulation,  the  following  nonlinear  system  has  been 
used: 


Xl 
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"V 

"o" 

x\~x\x3 

~x\ 

X2 

= 
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*2 

+ 
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-Xlx2  +  x22  +  x32 

+ 
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~*3- 

_  -3 
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-1  _ 

— *3- 

_1_ 

X2X3 

-X2- 
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(5.8) 


This  system  is  exactly  linearizable,  i.e.  it  satisfies  the  conditions  of 
the  Hunt-Su  theorem.  The  first  and  second  degree  parts  of  the  system 
(5.14)  were  entered  into  the  computer  program,  and  the  linearizing 


coordinate  change  was  found  to  be: 

z\~  x\-  0.19697*!2  +  *!jc3-  0.5jc22 

(5.9a) 

z2  =  *2  +  4*j2  -  2.3939*1jc2 

(5.9b) 

z3  =*3+  7jt1*2  -  2.3939*1*3  -  1.3939*22  +^32 

(5.9c) 

Second  and  first  degree  terms  in  the  nonlinear  feedback  were: 

a(2)(.x)  =  -1.4091*j2+  1xxx2- 2jCjJc3+  7.1061jc22- 0.18182x^3  -jc32  (5.10) 

P(1)(*)  =  -2.3939*!  + *2+  2x3  (5.11) 


The  above  coordinate  change  and  feedback  transform  the  system  (5.14) 
into: 


(5.12) 


with  the  input  v  defined  as 


v  =  a(2)(x)  +  (i  +  (3(1\jc))u. 


(5.13) 
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A  closed  loop  feedback  was  designed  by  the  quadratic  optimal  regulator 
procedure  with  Q  =  /  and  R  =  I.  The  feedback  gains  were  found  as: 

K  =  [  0.1623  6.9158  2.9789  ] 

which  resulted  in  the  closed-loop  eigenvalues: 

Xh2  =  -0.7245  ±0.85 15/ 

=  -2.5299. 

For  the  following  graphs,  the  curves  are: 

LMLF  ( - ):  Reference  Linear  Model  with  Linear  Feedback. 

NSLF  ( - ):  Nonlinear  System  with  the  Linear  Feedback  design. 

NSQF  ( - ):  Nonlinear  System  with  Quadratic  Feedback  design. 

Figures  5.1,  5.2,  and  5.3  show  the  response  curves  of  the  above  three 
systems  for  states  xvx2,  and  x3  respectively.  Initial  conditions  are 
xx  =  0.05;  x2  =  0.05;  x3  =  0.05  and  there  is  no  forcing  input.  The  solid 
curve  of  the  LMLF  behaves  as  expected  from  a  linear  model.  The  curve  of 
the  NSLF  goes  unstable,  while  the  NSQF  tracks  LMLF  quite  successfully. 
The  NSQF  rapidly  approaches  the  LMLF  at  steady  state.  This  is  not 
surprising,  since  in  z  coordinates  the  system  is  exactly  linear.  We  note  a 
somewhat  larger  overshoot  in  the  transient  of  the  NSQF,  especially 
apparent  in  the  curve  of  x3  in  Fig.  5.3.  This  is  a  tendency  of  the  NSQF  to 

become  unstable  in  this  neighborhood.  When  the  magnitude  of  the  initial 


conditions  were  further  increased,  the  response  of  the  NSQF  went  unstable 
(not  plotted). 

In  Fig.  5.4,  the  magnitude  of  the  control  effort  used  for  driving  each 
system  is  shown.  The  quantity  is  equal  to  the  linear  feedback  for  LMLF 
and  NSLF,  and  for  the  NSQF  it  includes  the  quadratic  terms  as  well.  This 
graph  offers  some  good  insight  in  both  the  transient  and  the  steady  state 
response  of  the  NSQF.  An  unstable  behavior,  if  any,  is  more  readily 
apparent  in  the  graph  of  the  control  effort.  In  this  first  simulation,  we 
notice  that  the  magnitude  of  this  input  goes  to  zero  at  steady  state.  In  other 
words,  when  the  disagreement  between  the  linear  model  and  the  nonlinear 
system  is  small,  the  magnitude  of  the  input  necessary  to  drive  the  system  is 
likewise  small. 
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LMLF  (  -  ) _ NSLF  ( - ) _ NSQF  ( 

Fig.  5.1.  Free  response  of  state  xl  of  Example  1  with  nonzero  initial  conditions. 


Magnitude  of  die  control  effort  ul 


LMLF  (  -  ) _ NSLF  ( - ) _ NSQF  ( - 

Fig.  5.4.  Magnitudes  of  the  inputs  for  Example  1  with  nonzero  initial  conditions. 
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Figures  5.5,  5.6,  and  5.7  show  the  phase  poi.rait  plots  of  the  same 
simulation.  We  note  that  the  plots  shown  are  projections  of  the  true  three- 
dimensional  phase  portrait  onto  the  respective  planes  shown.  All  models 
start  at  the  same  initial  condition.  While  LMLF  and  NSQF  decay  towards 
the  origin  rapidly,  the  NSLF  moves  away  from  the  origin,  i.e.  it  clearly 
shows  the  tendency  of  instability. 


Fig.  5.7.  Phase  portrait  of  x2  versus  x3  for  Example  1  with  nonzero  initial  conditions. 


Figures  5.8  through  5.14  show  the  response  of  the  system  to  an 
impulse  input.  All  initial  conditions  have  been  set  equal  to  zero.  In  this 
case,  the  nonlinear  control  causes  the  system  to  reach  instability  very 
rapidly.  Note  that  the  instability  is  more  evident  in  the  plot  of  control 
feedbacks.  Even  though  the  NSQF  would  achieve  good  tracking  in  the 
neighborhood  of  the  nominal  point,  as  seen  in  the  earlier  Figs.  5.1  through 
5.7  for  the  simulation  with  nonzero  initial  conditions  and  no  forcing,  it  has 
a  tendency  to  decrease  stability  bounds  for  some  systems.  Stability 
properties  of  a  nonlinear  system  are  very  difficult  to  analyze,  and  there  is 
no  rigorous  theory  that  explains  the  stability  behavior. 


Fig.  5.10.  Response  of  the  state  x3  of  Example  1  to  an  impulse  input. 


Fig.  5.11  for  the  magnitude  of  the  linearizing  control  clearly  shows 
the  instability  of  the  NSQF  in  this  case.  A  possible  cause  of  this  behavior 
might  be  a  loss  of  rank  of  the  term  (i  +  P(1)(*))  (a  scalar  in  this  example) 
which  is  inverted  during  calculating  the  nonlinear  feedback  (see  Eqn.  5.5). 
Note  that,  as  seen  in  Eqn.  5.13,  this  corresponds  to  a  loss  of  controllability 
for  the  nonlinear  system.  The  phase  portrait  plots  for  the  impulse  response 
are  shown  in  Figs.  5.12  through  5.14. 


Figures  5.15  through  5.18  show  the  response  of  the  system  to  a  step 
input  of  magnitude  0.2.  All  initial  conditions  have  been  set  to  zero.  In  this 
case,  the  system  NSLF  reaches  instability  very  rapidly.  The  NSQF  is  not 
unstable  (even  though  for  step  inputs  of  higher  amplitude,  it  would 
eventually  exhibit  unstable  behavior),  but  it  displays  a  constant  steady  state 
error  in  tracking  the  reference  step  input.  Since  the  phase  portrait  plots 
don't  offer  much  insight  in  this  case,  they  were  not  plotted. 


LMLF  (  -  -  ) _ NSLF  ( - ) _ NSQF  ( - ) 

Fig.  5.15.  Response  of  the  state  Xj  of  Example  1  to  a  step  input 


LMLF  ( - ) _ NSLF  ( - ) _ NSQF  ( - )| 

Fig.  5.18.  Magnitudes  of  the  control  inputs  for  Example  1  for  a  step  input. 

The  graphs  shown  in  Figs.  5.19  through  5.25  present  the  response  of 
the  system  to  a  sinusoidal  input  u(t)  =  Asin(cor)  with  the  parameters  to  =  6, 
A  -  1.  This  simulation  is  probably  the  most  interesting  case  in  displaying 
the  advantage  of  the  nonlinear  feedback.  The  NSLF  has  a  constant  offset 
away  from  the  equilibrium  point.  For  NSQF,  this  average  error  rapidly 
goes  to  zero  since  in  z  coordinates,  the  system  is  exactly  linear.  After  the 
initial  transients  die  out,  the  LMLF  and  the  NSQF  oscillate  around  an 
equilibrium  at  the  origin,  i.e.  their  average  is  zero. 

We  should  emphasize  that  the  steady-state  equilibrium  around  which 
each  model  oscillates  is  directly  related  to  the  average  of  the  nonlinear 
terms  in  the  vector  fields  over  each  period  of  the  sinusoidal  input.  The 
strongest  case  we  can  make  in  favor  of  the  NSQF  is  that  this  average  is 
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extremely  close  to  zero,  i.e.  it  is  like  that  of  an  exactly  linear  model.  Figs. 
5.19,  5.20,  and  5.21  show  the  transient  and  steady  state  time  responses  of 
all  three  models.  The  above  argument  is  clearly  obvious  in  these  figures, 
especially  in  Fig.  5.19  since  the  state  x3  is  farthest  away  from  the  input  in 

terms  of  integration  (note  the  Controller  form  of  the  linear  part  of  the 
system  in  Eqn.  5.8). 

One  should  also  note  that  the  very  impressive  improvement  achieved 
by  the  quadratic  control  toward  causing  the  system  to  track  the  linear 
model,  as  seen  in  Fig.  5.19,  is  due  to  the  fact  that  this  example  is  exactly 
linearizable.  A  nonlinear  feedback  and  coordinate  change  that  achieves 
exact  linearization  could  as  well  be  calculated  using  the  Hunt-Su 
linearization  method.  However,  in  our  approach,  we  also  minimized  the 
length  of  the  vector  formed  by  the  coefficients  of  <f>(2\  a(2),  and  p(1).  In 
contrast,  the  Hunt-Su  theorem  will  not,  in  general,  yield  the  minimum 
solution  (in  the  sense  that  we  have  defined  in  Ch.  3)  for  the  coordinate 
change  and  feedback. 

For  n  =  3  the  Hunt-Su  method  yields  a  one-parameter  family  of 
solutions  to  the  approximate  linearization  problem  for  systems  that  are 
exactly  linearizable  up  to  degree  2,  and  the  choice  for  the  free  parameter 
remains  to  be  determined.  The  correct  choice  for  this  parameter  in  order 
to  obtain  the  “smallest”  coordinate  change  and  feedback  is  precisely  the 
solution  found  by  our  method.  Therefore,  the  specific  nonlinear 
coordinate  change-nonlinear  feedback  pair  calculated  for  this  example  is 
expected  to  yield  a  better  performance  during  the  transient  response.  In 
the  steady  state,  the  difference  in  performance  between  the  coo/dinate 
change  and  feedback  we  have  found  and  other  solutions  may  not  be 


appreciable  if  the  trajectories  are  too  close  to  the  origin.  The  nature  of  our 
approach  allows  us  to  immediately  extend  the  method  to  systems  that  are 
not  exactly  linearizable  (see  examples  2,  3,  and  4). 

Fig.  5.22  shows  the  plot  of  the  feedback  terms,  where  the 
feedback  for  NSQF  has  an  offset  from  the  equilibrium.  This  clearly 
indicates  that  the  nonlinear  input  drives  the  system  to  a  zero  average  by 
introducing  a  bias  into  the  system.  In  Figs.  5.23,  5.24,  and  5.25  we  present 
the  phase  portrait  plots  of  the  three  models.  The  steady  state  equilibrium 
points  are  again  clearly  seen,  especially  in  the  graph  of  Jtj  vs  x2  where  both 

the  LMLF  and  the  NSQF  oscillate  around  the  origin.  The  NSLF  settles 
around  a  non-zero  equilibrium. 


Fig.  5.19.  Response  of  the  state  jq  of  Example  1  to  a  sinusoidal  input. 
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5.3.2.  Example  2: 

As  a  second  example  we  chose  a  nonlinear  control  system  that  is  not  exactly 
linearizable: 
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The  following  coordinate  transformation  and  feedback  pair  were  found: 


Zj  =  xl  -  0.47665x12+  0.11111*!X2  +  0.28221jc22-  0.16667*!*3 

-  0.61997*2*3  +  0.30999*32  (5.15a) 

z2~xi~  0.95331^2  +  0.055556*22  +  0.11 11  +  0.34219*2*3 

+  0.32447*32  (5.15b) 

z3  =  *3  +  x12-jc1x2  +  0.15781*22  +  0.046694*  j*3  -  0.02897  1*2*3 

+  0.48228*/  (5.15c) 

The  terms  in  the  nonlinear  feedback  were: 

a(2)(jc)  =  XjXj  +  0.30675*2*3  -  0.084526*/  (5.16) 

P(1)(jc)  =  -0.89775*!  +  0.36997*2+  0.96336*3  (5.17) 


The  above  coordinate  change  and  feedback  found  will  exactly  transform 
the  following  linearizable  system  (rather  than  (5.14))  into  z  coordinates: 
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L  0.05555x1+0.08333jc2+0.02777x3  J 

-0.05555jc22-0.05555x2x3+0.9444jc32 

jc12+jc1x2+0.05555x22+0.05555x2jc3+0.05555x3" 

-0.05555x22-0.05555x2X3-0.05555a:32 


(5.18) 


i.e,  (5.18)  will  transform  with  (5.15),  (5.16)  and  (5.17)  into  z  coordinates 
as 


z  =  Fz  +  Gv  (5.19) 

with  the  input  defined  with 

v  =  a(2)(jc)  +  (I  +  p(1)0c))w.  (5.20) 

The  next  step  in  the  computation  process  was  designing  a  closed  loop 
feedback  for  the  linear  part  of  the  plant.  The  feedgack  gains  were 
calculated  by  the  program  with  a  quadratic  optimal  regulator  design 
procedure.  The  matrix  Q  was  taken  to  be  the  identity,  and  R  was  set  to 
unity.  The  feedback  gains  were  thus  found  to  be: 


AT  =  [1  1.7321  1] 


(5.21) 


and  these  gains  placed  the  eigenvalues  of  the  linear  part  of  the  closed  loop 
system  at: 
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Xl2  =  -0.866  ±  0.5/ 

X,3  =  -1. 

Since  we  designed  the  closed  loop  feedback  gains  for  the  linearized  system 
in  z  coordinates,  we  obtained  the  equivalent  feedback  u  in  x  cordinates  by 
using  (5.1).  This  feedback  was  calculated  as: 

u  =  (I  +  |3(1)(jc))-1  {K(x  ~  <j)(2)(;t»  -  a(2)(*)  +  r }  (5.7) 

For  the  following  graphs,  similar  to  the  first  simulation,  the  curves  are: 

LMLF  ( - ):  Reference  Linear  Model  with  Linear  Feedback. 

NSLF  ( - ):  Nonlinear  System  with  the  Linear  Feedback  design. 

NSQF  ( . ):  Nonlinear  System  with  Quadratic  Feedback  design. 

The  plots  in  Figs.  5.26  through  5.29  show  the  response  of  the  models  to  an 
initial  condition  of  xl  =  0.35;  x2  =  0.35;  x3  -  0.35  and  a  zero  reference 
input.  In  Figs.  5.26,  5.27,  and  5.28  the  time  responses  of  the  states  jcj,  x2, 
and  jc3  are  plotted,  respectively.  The  NSQF  shows  some  tendency  toward 
instability  until  t  =  1  (a  simulation  for  slightly  increased  values  for  the 
initial  conditions,  not  plotted,  displayed  a  singularity  in  this  neighborhood). 
After  the  NSQF  recovers  from  this  region,  it  tracks  the  LMLF  extremely 
closely.  The  NSLF  curve,  on  the  other  hand,  settles  down  to  the  same 
value  much  later,  and  displays  some  amount  of  overshoot. 
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The  improvement  achieved  by  the  NSQF  in  responding  like  a  linear 
system,  especially  as  seen  in  Fig.  5.26,  makes  a  very  strong  case  for  our 
method.  Despite  the  fact  that  the  example  is  not  an  exactly  linearizable 
system,  there  is  almost  an  order  of  magnitude  difference  between  the 
NSQF  and  the  NSLF  in  percent  deviation  from  the  ideal  linear  model.  The 
time  that  the  NSQF  settles  at  zero  equilibrium  is  also  very  close  to  that  of 
the  LMLF,  whereas  the  NSLF  displays  transient  behavior  for  a  much 
longer  period  of  time. 

The  curves  of  the  inputs  of  Fig.  5.29  shows  the  steep  increase  in  the 
value  of  the  control  effort  between  t  =  0.5  and  t  -  1,  the  region  in  which 
the  NSQF  has  a  tendency  of  instability.  The  inputs  settle  to  zero 
afterwards. 


LMLF  (  .  ) _ NSLF  ( - ) _ NSQF  ( -  - . ) 

Fig.  5.26.  Free  response  of  state  of  Example  2  with  nonzero  initial  conditions. 
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Fig.  5.29.  Magnitudes  of  the  inputs  for  Example  2  with  nonzero  initial  conditions. 


Figures  5.30  through  5.32  show  the  time  response  of  the  three 
models  to  an  impulse  input  of  unit  magnitude.  The  NSQF  displays  unstable 
behavior,  and  its  curve  is  not  plotted  through  the  entire  time  of  the 
simulation  because,  due  to  instability,  the  integration  algorithm  could  not 
carry  out  the  computation  any  further.  Since  the  stability  properties  of 
nonlinear  systems  are  not  well  known,  the  cause  of  this  instability  is  not 
entirely  clear.  Fig.  5.33  shows  the  magnitude  of  the  inputs. 


In  Figs.  5.34  through  5.36  are  plotted  the  time  responses  of  the  same 
models  to  a  step  input  of  amplitude  0.5.  In  these  simulations,  the  NSQF 
exhibits  stable  behavior,  and  settles  toward  the  optimally  damped  response 
curve  of  the  LMLF  within  5  time  units.  On  the  contrary,  the  NSLF 
quickly  goes  unstable  and  can  not  track  the  reference  signal  at  all.  One  can 
choose  a  sufficiently  small  amplitude  for  the  step  input  in  order  to  obtain 
stable  behavior  for  the  NSLF,  but  the  characteristics  of  the  response  curve 
in  comparison  with  the  NSQF  would  still  display  larger  errors  in  tracking 
the  LMLF.  Fig.  5.37  shows  the  magnitude  of  the  inputs  in  this  case.  The 
input  values  (except  for  NSLF,  which  goes  unstable)  settle  at  a  steady  state 
nonzero  constant  after  some  transient.  This  is  expected  because  the 
reference  signal  is  a  step  input. 


Fig.  5.34.  Response  of  the  state  jq  of  Example  2  to  a  step  input. 


Fig.  5.37.  Magnitudes  of  the  control  inputs  for  Example  2  for  a  step  input. 


Figs.  5.38,  5.39,  and  5.40  show  the  transient  and  steady  state  time 
domain  response  of  all  three  models  of  Example  2  to  a  sinusoidal  input. 
The  reference  signal  was  r{t )  =  As'm((£>t  )  with  co  =  6,  A  =1.  Similar 
comments  as  in  Example  1  can  be  made  for  this  set  of  responses.  The 
NSQF  clearly  has  an  advantage  over  the  NSLF  in  tracking  the  sinusoidal 
reference  input.  In  this  simulation,  since  the  linearization  is  not  exact,  the 
NSQF  does  not  follow  the  LMLF  exactly.  However,  in  comparison  with 
the  curve  of  NSLF  this  deviation  is  negligible.  This  simulation  is  again  the 
most  interesting  case  in  displaying  the  advantage  of  the  nonlinear  feedback. 
The  NSLF  has  a  constant  average  offset  from  the  equilibrium  point.  For 
NSQF,  this  offset  is  much  smaller  since  in  z  coordinates,  the  system  is 
closer  to  a  linear  model. 
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The  above  example  verifies  our  claim  in  the  approximate 
linearization  procedure  of  minimizing  the  “distance”  between  the  given 
system  and  the  “closest  linearizable”  system.  Of  course  the  improvement 
achieved  by  the  quadratic  feedback  is  dependent  on  how  far  away  the 
original  nonlinear  system  is  from  the  “closest  linearizable”  system  before 
the  linearization  is  done.  After  the  initial  transients  die  out,  the  LMLF 
settles  into  an  equilibrium  at  the  origin,  i.e.  its  average  is  zero  as  expected 
from  an  exactly  linear  system.  The  NSQF  displays  oscillattions  extremely 
close  to  the  LMLF.  Again  we  emphasize  that  the  steady-state  equilibrium 
around  which  each  model  settles  is  directly  related  to  the  average  of  the 
nonlinear  terms  in  the  vector  fields  over  each  period  of  the  sinusoidal 
input.  In  the  case  of  NSQF  this  average  is  very  small,  i.e.  it  is  closer  to  an 
exactly  linear  model.  The  argument  is  more  obvious  in  Fig.  5.38  since  the 
state  jcj  is  farthest  away  from  the  input  in  terms  of  the  number  of 

integrations. 

Fig.  5.41  shows  the  plot  of  the  feedback  inputs,  each  of  which  is 
periodic  as  expected.  It  is  very  interesting  to  observe  that  while  the 
average  values  of  the  control  inputs  for  LMLF  and  NSLF  are  zero  (or 
almost  zero),  the  average  of  the  input  for  NSQF  deviates  from  zero.  This 
fact  clearly  has  a  connection  to  the  above-mentioned  argument  for  the 
averages  of  the  responses  for  the  three  systems.  It  appears  that  the 
quadratic  feedback  introduces  a  bias  into  the  nonlinear  system  which  drives 
it  towards  the  origin  (on  the  average). 

In  Figs.  5.42,  5.43,  and  5.44  we  present  the  phase  port  ait  plots  of 

the  three  models.  The  steady  state  equilibrium  points  are  again  clearly 
seen,  especially  in  the  graphs  of  jc}  versus  x 2  and  X\  versus  *3. 


Fig.  5.42.  Phase  portrait  of  xx  versus  jc2  for  Example  2  for  a  sinusoidal  input. 


Fig.  5.44.  Phase  portrait  of  jc2  versus  x3  for  Example  2  for  a  sinusoidal  input. 


5.4.3.  Example  3: 

The  following  third  order  system  has  been  simulated  as  another  example. 
This  system  is  not  exactly  linearizable  and  it  has  open  loop  poles  very  close 
to  the  desired  closed  loop  pole  locations. 
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-4.1128  _ 

-*3- 
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The  characteristic  scales  of  each  state  and  the  input  were  assumed  to  be 
equal  to  unity.  The  following  coordinate  transformation  and  feedback  pair 
were  found: 

zx  =  xx  +  0.86986jc12+  0.62932x^2-  2.8055x^3  -  0.02927x22 

+  0.21302x2X3  -  0.0303 89x32  (5.23a) 

z2  =  x2  -  1.5358xj2  +  2.2273x1x2  +  1.0036XJX3  +  0.36424x22  -  3. 2291 

+  1.7879x32  (5.23b) 

z3=x3  +  1.643xj2-  0.062292x^2  -  1.9677xjX3  +  0.26052x22 

-  0.25082x2X3  +  0.014877x32  (5.23c) 

The  terms  in  the  nonlinear  feedback  were: 

a(2)(x)  =  7.3167x,2  -  0.0043799x1x2  -  14.095XJX3+1.5513X22  -  4.5535x2X3 
+  13.072x32  (5.24) 

P(1)(x)  =  1.1933xj  +  0.72725x2-  0.43993x3  (5.25) 

The  closest  linearizable  system  had  the  following  nonlinear  terms: 

fXx)  = 

"l.0001x12+2.0002x1x2+1.0001x1x3+0.00044583x22+1.0002x2x3+1.0001x32' 

X  j  2+2x  jX2+  3x  jX3+x22+X2X3+x32 


0.9998  1xjX2  +X!X3+1 .9996x22+0.99981x2x:3 
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1 . 0004.x  j  +0.001 984 12x2  +  2.0003x3 


g(1'W  = 


0.99986;e2  +  jc3 


L-0.000391772X!-  0.0017123a:2+  1.9997jc3J 


The  system  has  open  loop  poles  at  -0.5227;  -2.6479  ±  0.3841j.  A  closed 
loop  feedback  for  the  linear  part  of  the  plant  was  calculated  using  a 
quadratic  optimal  regulator  design  procedure  with  the  weighing  on  the 
states  and  the  input  equal  to  unity.  The  feedback  gains  were  found  as: 
AT  =  [-0.4730  0.0177  1.4224] 

and  the  above  gains  placed  the  eigenvalues  of  the  linear  part  of  the  closed 
loop  system  at: 

A,!  =  -0.5056 

=  -1.7647 

=  -2.5633 


Since  the  open  loop  system  already  has  stable  roots,  the  linear  feedback 
gains  that  are  necessary  to  drive  the  system  are  small.  In  the  simulation 
plots  the  response  curves  are  defined  with  the  following  legend  as  before: 

LMLF  ( - ):  Reference  Linear  Model  with  Linear  Feedback. 

NSLF  ( - -  •  ):  Nonlinear  System  with  the  Linear  Feedback  design. 

NSQF  ( . ):  Nonlinear  System  with  Quadratic  Feedback  design. 

Figures  5.45  through  5.48  show  the  response  of  the  system  to  an  initial 
condition  of  x  =  [0.1  0  0]'  (referred  to  as  initial  condition  #1  in  the  figures) 
and  the  feedback  input.  The  reference  signal  is  set  to  zero.  The  response 


curves  of  Figs.  5.45,  5.46,  and  5.47  show  the  vastly  superior  behavior  of 
tne  NSQF  in  terms  of  responding  like  a  linear  system,  i.e.  in  tracking  the 
LMLF  extremely  closely  immediately  after  the  transient  response  dies  out. 

We  should  note  that  in  this  specific  example,  the  behavior  of  the 
NSQF  in  tracking  the  LMLF  should  be  considered  even  more  impressive 
because  the  system  is  not  in  controller  canonical  form.  When  a  system  is  in 
controller  form,  the  tracking  errors  appear  to  be  much  larger  in  those 
states  that  are  further  away  from  the  input  in  terms  of  integration.  This  is 
because  (considering  a  single  input  system)  the  signals  are  differentiated 
n  - 1  times  until  the  first  state  is  reached,  and  all  the  errors  are  therefore 
amplified  (Example  1  is  a  good  demonstration  of  this  fact).  In  this 
example,  we  observe  excellent  tracking  behavior  for  NSQF  on  all  the 
states,  even  without  the  benefit  of  the  presence  of  controller  form. 

An  inspection  of  the  response  characteristics  of  jc,  jc2  and  x3  in  Figs. 
5.45,  5.46,  and  5.47,  respectively,  shows  that  in  terms  of  settling  times  at 
the  zero  steady-state  value,  the  NSLF  is  much  superior  to  both  the  linear 
ideal  model  LMLF,  and  the  NSQF.  However,  this  is  a  fortuitous  behavior 
of  the  system  due  to  the  specific  nonlinearities  in  this  example.  The  goal  of 
the  nonlinear  control  design  is  to  track  the  linear  system  as  close  as 
possible,  rather  than  to  achieve  the  fastest  response  or  the  shortest  settling 
time.  One  should  also  remember  that  the  transient  and  the  steady  state 
responses  of  a  linear  system  can  be  arbitrarily  adjusted  if  the  system  is 
controllable  and  if  there  is  no  restriction  in  the  magnitude  of  the  requierd 
control  effort.  Therefore  this  should  be  seen  as  an  isolated  phenomenon 
which  results  from  the  particular  nonlinearities  present,  as  well  as  from  the 
choice  of  eigenvalues  of  the  linear  part  of  the  model. 


During  the  transient  part  of  the  response  the  NSQF  is  not  close  to 
LMLF  because  the  nonlinear  terms  are  dominant.  This  is  especially 
apparent  in  the  curves  of  x2  and  jc3  (Figs.  5.46  and  5.47).  After  the  first 

half  of  the  simulation,  this  difference  becomes  virtually  indistinguishable. 

In  the  magnitude  plot  of  the  inputs  (Fig.  5.48),  the  control  effort  for 
LMLF  and  NSLF  are  extremely  small  (almost  zero  for  this  simulation)  and 
not  visible  on  the  scale  of  the  plot.  This  is  because  the  open  loop  system 
poles  are  very  close  to  the  closed  loop  values,  and  the  control  effort  needed 
by  a  linear  controller  for  driving  the  system  to  the  origin  is  very  small  in 
both  cases.  In  contrast,  the  NSQF  needs  some  additional  control  effort  to 
linearize  the  system.  This  magnitude  approaches  zero  as  the  states 
approach  the  origin. 


LMLF  (  ~  -  ) _ NSLF  ( - ) _ NSQF  ( -  - . ) 

Fig.  5.45.  Free  response  of  state  JC]  of  Example  3  with  nonzero  initial  condition  #1. 
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Fig.  5.48.  Magnitudes  of  the  inputs  for  Example  3  with  nonzero  initial  condition  #1. 


In  the  following  simulation,  the  initial  condition  of  the  previous  case 
has  been  reversed  in  sign  as  x  =  [-0.1  0  0]’  (referred  to  as  initial  condition 
#2  in  the  captions  of  the  graphs).  The  reason  for  choosing  this  set  of  initial 
conditions  is  to  test  the  behavior  of  the  nonlinear  system  for  sensitivity  to 
initial  conditions.  More  specifically,  we  would  like  to  check  whether  the 
NSQF  will  exhibit  the  same  excellent  performance  for  a  variety  of  initial 
conditions  as  was  seen  in  the  simulation  for  the  initial  condition  #1.  The 
responses  and  the  control  efforts  are  shown  in  Figs.  5.49  through  5.52. 

The  NSLF  is  indeed  initial  condition  sensitive  (see  the  large 
deviations  in  Figs.  5.49,  5.50,  5.51  from  the  LMLF  that  it  is  supposed  to 
track  using  the  feedback  law  derived  from  the  first  order  approximation). 
This  is  expected  since  the  NSLF  does  not  have  the  benefit  of  our  nonlinear 
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control  scheme.  The  same  plots  show  that  the  NSQF  benefits  greatly  from 
the  nonlinear  control.  Indeed,  comparing  the  response  curves  of  initial 
condition  #1,  the  NSQF  control  displays  an  even  more  impressive  response 
in  tracking  the  LMLF,  as  seen  from  Figs.  5.49,  5.50,  and  5.51  of  the  three 
states.  To  emphasize  once  more,  the  point  here  is  not  that  the  NSQF 
follows  the  LMLF  better  for  this  set  of  initial  conditions  over  some  others, 
but  that  it  is  consistent  in  this  behavior  for  a  variety  of  initial  conditions. 

The  input  effort  is  again  very  small  for  the  LMLF  and  the  NSLF 
compared  to  the  NSQF.  This  is  not  regarded  as  a  disadvantage  for  the 
quadratic  control.  The  reason  the  NSQF  seems  to  require  large  amounts  of 
control  effort  to  linearize  the  system  is  that  the  open  loop  eigenvalues  of 
the  linear  part  of  the  system  are  already  near  the  desired  closed  loop 
locations  and  the  linear  feedback  gains  are  therefore  very  small. 


LMLF  ( —  —  ) _ NSLF  ( - ) _ NSQF(-~  - - ) 

Fig.  5.49.  Free  response  of  state  jc,  of  Example  3  with  nonzero  initial  condition  #2. 


Fig.  5.52.  Magnitudes  of  the  inputs  for  Example  3  with  nonzero  initial  condition  #2. 


We  continue  to  further  test  the  sensitivity  of  the  system  of  Example  3 
by  applying  nonzero  initial  conditions  on  other  states.  In  the  following,  we 
present  another  set  of  simulations  for  the  same  system  with  nonzero  initial 
conditions  for  xy  The  initial  condition  values  are  x  =  [0  0  0.1]  (initial 

condition  #3)  and  x  =  [0  0  -0.1]  (initial  condition  #4).  Response  curves 
and  the  control  efforts  for  initial  condition  #3  are  shown  in  Figs.  5.53 
through  5.56.  Figs.  5.57  through  5.60  present  the  responses  and  the  input 
effort  for  initial  condition  #4. 

Comparing  the  responses  of  state  x{  in  Figs.  5.53  and  5.57  for  initial 
conditions  #3  and  #4,  we  can  state  the  same  conclusions  that  were  presented 
for  initial  conditions  #1  and  #2.  The  NSQF  tracks  the  ideal  linear  model 
LMLF  very  closely  as  seen  in  both  of  these  figures.  Comparison  of  these 
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two  figures  show  that  the  sensitivity  to  changes  in  initial  conditions  of  the 
NSQF  is  minimal.  The  tracking  performance  by  the  NSQF  of  the  LMLF  is 
excellent  in  both  cases.  Similar  conclusions  are  reached  when  the  responses 
for  the  set  of  initial  conditions  #3  and  #4  of  the  state  x2  in  Figs.  5.54  and 
5.58,  and  of  the  state  x3  in  Figs.  5.55  and  5.59  are  compared. 

The  control  efforts  seen  in  Figs.  5.55  and  5.60  show  that  the  NSQF 
requires  larger  input  values  compared  to  a  linear  design.  As  explained 
earlier,  the  reason  that  the  LMLF  and  NSLF  require  such  small  input 
efforts  is  that  the  open  loop  and  the  closed  loop  eigenvalues  (of  the  linear 
part  of  the  example)  are  very  close  to  each  other. 


LMLF  (  ~  ) _ NSLF  ( - ) _ NSQF  ( . ) 

Fig.  5.53.  Free  response  of  state  jc,  of  Example  3  with  nonzero  initial  condition  #3. 


Fig.  5.60.  Magnitudes  of  the  inputs  for  Example  3  with  nonzero  initial  condition  #4. 


As  the  final  simulation  of  Example  3,  we  present  the  response  of  the 
system  to  a  sinusoidal  reference  input  of  r  =  sin(6/).  As  in  the  Examples  1 
and  2  presented  earlier,  the  average  value  of  the  sinusoidal  response  gets 
closer  to  zero  for  the  NSQF.  This  improvement  is  more  apparent  in  the 
graphs  of  xx  and  x3  in  Figs.  5.61  and  5.63.  For  the  response  xx  of  the 

NSLF  in  Fig.  5.61,  one  can  observe  substantial  nonzero  average  as  well  as 
a  distorted  non-sinusoidal  shape  in  the  response  curve.  The  NSLF  also 
displays  a  phase  deviation  away  from  the  linear  behavior,  which  is  more 
apparent  in  the  response  of  x2  in  Fig.  5.62.  These  three  characteristics 

(namely  nonzero  average,  non-sinusoidal  response,  and  a  phase  shift  away 
from  the  linear  behavior)  are  typical  characteristics  of  a  nonlinear  system, 
and  they  have  been  improved  by  the  quadratic  feedback  of  the  NSQF; 


In  Fig.  5.63,  the  input  for  the  NSQF  is  periodic  and  strongly 
nonlinear.  The  nonlinear  shape  of  the  input  signal  is  not  arbitrary;  this 
specific  nonlinearity  in  the  input  effectively  cancels  out  the  nonlinearities  in 
the  system  (up  to  degree  two).  The  magnitudes  of  the  control  inputs  for 
LMLF  and  NSLF  are  too  small  to  be  seen  in  the  given  scale.  Again  this  is 
because  the  linear  feedback  gains  were  small  since  the  open  loop  poles  were 
close  to  the  closed  loop  values. 


Fig.  5.61.  Response  of  the  state  Xj  of  Example  3  to  a  sinusoidal  input. 
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Fig.  5.64.  Magnitude  of  the  control  efforts  for  Example  3  for  a  sinusoidal  input 


5.3.4.  Nearly  Circular  Satellite-Example  4: 

As  it  was  mentioned  in  the  beginning  of  this  section,  in  the  simulations  of 
Examples  1,  2,  and  3,  we  have  presented  cookbook  nonlinear  systems.  The 
first  example  was  exactly  linearizable;  the  second  example  was  not  exactly 
linearizable  but  in  controller  canonical  form;  and  the  third  example  had 
open  loop  poles  very  close  to  the  desired  closed  loop  poles. 

The  fourth  example  for  the  approximate  linearization  method  that 
we  present  now,  is  a  physical  system.  The  following  problem  has  been 
adopted  from  Kailath  [26].  We  consider  a  satellite  of  mass  m  in  planar 
earth  orbit  specified  by  its  position  and  velocity  in  polar  coordinates.-as 
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x  —  [r  r  0  0  ]'  (5.26) 

where  r  is  the  orbital  radius  and  0  is  the  angle  of  the  radius  vector  with 
respect  to  a  stationary,  inertial  reference  line.  The  input  thrusts  or  forces 
are  written  as 

u  =  [ur  Uq  ]’  (5.27) 

which  are  inputs  along  the  radial  and  tangential  directions.  These  are 
applied  by  using  small  rocket  engines.  The  equations  of  motion  are: 

(5.28) 


where  m  is  the  satellite  mass  and  k  is  the  orbital  constant.  When  the 
equations  of  motion  are  linearized  around  a  nominal  orbit,  the  system  is 
locally  controllable  with  the  above  inputs.  If  either  one  of  the  controls  is 
assumed  to  be  lost,  the  linearized  system  is  still  controllable  from  only  ue 
but  not  from  only  ur. 

In  the  following  example  we  assume  that  ur=  0  and  we  control  the 
satellite  with  m0  only.  The  point  of  this  assumption  is  that  when  both 
controls  are  present,  all  the  nonlinear  terms  in  the  system  can  be  cancelled 
exactly  by  appropriate  feedback,  and  a  nonlinear  coordinate  change 
becomes  unnecessary.  Furthermore,  when  both  of  the  inputs  are  present, 


x  = 


rO2 


k  ur 

-7+  — 
rL  m 


e 


-2r0 


2Sl 


133 


the  system  is  exactly  linearizable  up  to  any  desired  degree.  Thus  the 
problem  of  feedback  linearization  using  both  of  the  inputs  becomes  an 
uninteresting  exercise  in  algebra.  With  tangential  thrust  alone,  however, 
the  system  becomes  more  challenging  to  control,  and  a  nonlinear 
coordinate  transformation  is  needed  in  addition  to  nonlinear  feedback. 
With  this  assumption  we  present  a  stronger  demonstration  of  the 
effectiveness  of  the  approximate  linearization  method. 

In  the  following,  the  system  equations  will  be  rewritten  around  the 
nominal  states 

•*n„m  =  ('0  0  “O'  “ol 

and  the  state  vector  x  is  redefined  as  perturbations  from  this  nominal  orbit: 

x  =  [  hr  5 r  0-coo/  8(0  ]’  (5.29) 

Note  that  the  notation  in  (5.29)  has  been  changed  from  that  of  (5.28);  the 
state  vector  x  now  represents  the  vector  of  perturbations. 

When  perturbed  around  the  nominal  orbit,  the  Taylor  series 
expansion  of  the  system  equations  including  the  second  degree  terms  (i.e. 
truncated  at  the  third  degree  in  the  series  expansion)  becomes 
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with  the  numerical  values  assigned  as  : 
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(5.30) 


co0  =  n/1  rad/hr;  r0  =  12.74  Mm  (Mm  =  106  meters);  m  =  1000  kg. 

These  parameter  values  correspond  to  an  earth  satellite  with  a  period  of  4 
hours  and  an  orbit  radius  of  2  earth  radii.  The  set  of  values  represent  an 
intermediate  orbit  between  a  geosynchronous  satellite  and  a  low  altitude 
communications  satellite. 


Note  that  the  satellite  model  in  (5.30)  is  assumed  to  be  controlled 
only  by  the  tangential  thrust  m0.  In  order  to  apply  the  approximate 

linearization  procedure,  the  state  equations  must  be  normalized  according 
to  the  characteristic  scales  of  each  state.  The  following  characteristic  scales 
have  been  assumed  for  the  states  and  the  input: 

xlc  =  12.74  Mm.  =  (2  earth  radii) 

x2c  =1.58  Mm/hr. 

Jt3c  =  nil  radians 

x4c  =  nil  rad/hr. 


Ufa  =  10,000  kg. Mm/hr. 


The  eigenvalues  of  the  open- loop  system  are  at  2  =  0  and 
X3  4  =  ±1.58/,  i.e.  the  system  is  critically  stable.  A  linear  quadratic 
optimal  feedback  design  with  unit  weight  on  all  the  states  and  the  input 
places  the  closed  loop  poles  at: 

Xl  2  =  -2.9882  ±  3.3655/ 

X3A  =  -0.2429  ±  0.1829;. 

We  note  that  the  feedback  is  calculated  in  the  scaled  coordinates. 
Therefore  unit  weighting  of  the  states  and  the  input  in  the  cost  function  of 
the  quadratic  optimal  feedback  design  makes  sense,  unless  dictated  by  other 
design  criteria.  The  feedback  gains  that  place  the  poles  of  the  scaled  system 
in  the  above  locations  were  calculated  as: 

K  =  [-23.6995  -1.0502  1.0000  -13.0067] 

For  quadratic  linearization,  the  following  coordinate  change  and 
feedback  was  found  (The  computer  program  for  approximate  linearization 
presents  the  coordinate  change  and  feedback  in  the  unsealed  coordinates  so 
that  an  implementation  of  the  feedback  in  the  natural  coordinates  can  be 


readily  done): 

z,  =  *,  -0.098 137x,2  (5.31a) 

z2~x2~  0.19625x1jc2  (5.31b) 

z3  =  jc3-  0.0043866x!jc2+  0.0078554x2*4  (5.31c) 

z4  =x4  -0.032852*,2-  Q.\\116xxxA-  0.004875*22  +  0.31637x42  (5.31d) 

a<2)(x)  =  -1767.1x,x2  -  10499*^  (5.32) 


(5.33) 


|3(1)(*)  =  -0.19623*!  +  0.63274*4 

The  second  degree  terms  in  the  exactly  linearizable  system  are: 


y(2)  _ 


0 


-0.5878*j2  +3.1597*!*4+  12.736*42 
0.0014598*22 

L  0.01  947*j*2  -  0.15701*2*4  J 


8(l\x)  = 


0 

0 

-6.1665e-07*2 

-6.16e-06*j 


The  system  was  simulated  first  with  various  nonzero  initial 
conditions  and  no  forcing  input.  Then  a  sinusoidal  reference  input  was 
applied.  In  all  of  the  following  simulations,  the  curves  are  represented  as: 


LMLF  ( - ):  Reference  Linear  Model  with  Linear  Feedback. 

NSLF  ( - ):  Nonlinear  System  with  the  Linear  Feedback  design. 

NSQF  (-- . ):  Nonlinear  System  with  Quadratic  Feedback  design. 


In  the  first  set  of  simulations,  a  displacement  of  +2  Mm  along  the  radius 
from  the  nominal  orbit  was  used  as  an  initial  condition,  i.e.  *  =  [2  0  0  0]’ 
(initial  condition  #1).  This  value  corresponds  to  a  deviation  of  2,000 
kilometers  from  the  nominal  orbit.  When  a  satellite  is  first  put  up  into 
orbit,  it  is  likely  to  encounter  deviations  of  up  to  20%  from  its  nominal 
orbital  radius.  Therefore  the  initial  condition  chosen  is  a  realistic  value. 


The  response  in  Fig.  5.65  is  for  the  radial  displacement  of  the 
satellite  from  the  nominal  orbit.  The  NSQF  follows  the  LMLF  closely 
along  the  trajectory.  In  the  radial  velocity  response,  the  NSQF  is  very 
close  to  the  LMLF  whereas  the  NSLF  displays  an  overshoot  first,  and  it 
does  not  get  close  to  the  linear  system  for  a  longer  period  of  time.  In  Fig. 
5.67  of  the  reference  angle,  the  NSLF  exhibits  a  large  deviation  from  the 
linear  ideal  model  LMLF.  The  NSQF  tracks  the  linear  ideal  model 
extremely  closely  in  this  case.  It  appears  from  the  large  deviation 
exhibited  by  NSLF  from  the  linear  ideal  model  LMLF  in  Fig.  5.67  that  the 
nonlinearities  that  affect  the  dynamics  of  the  satellite  along  the  tangential 
direction  of  the  motion,  i.e.  those  which  influence  the  angular  deviation, 
have  a  stronger  effect  on  the  motion.  The  nonlinear  control  is  very 
effective  in  reducing  these  undesired  nonlinear  effects.  This  result  is  also 
due  to  the  fact  that  the  satellite  is  being  controlled  by  the  tangential  thrust 
alone 


The  behavior  of  the  NSQF  in  the  plot  of  the  angular  velocity.  Fig. 
5.68,  is  very  close  to  the  LMLF.  The  large  initial  overshoot  of  the  NSLF 
in  the  angular  velocity  is  obviously  the  cause  of  the  deviation  in  the  angular 
displacement,  since  the  displacement  is  the  integral  of  the  velocity. 

The  plot  of  the  input  efforts  in  Fig.  5.69  show  that  the  control  for 
the  NSQF  is  closer  to  the  LMLF.  In  the  input  plots,  the  initial  value  of  the 
thrust  was  vey  large  for  all  three  systems  (about  40,000).  Since  there  was 
not  an  appreciable  difference  between  these  curves  for  the  first  1/4  hrs.  of 
the  simulation,  part  of  the  plot  was  removed  to  readjust  the  scale  so  that 
the  difference  between  the  inputs  could  be  distinguished. 


LMLF  ( 
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Fig.  5.69.  Magnitudes  of  the  inputs  for  Near  Circular  Satellite  with  nonzero  initial  condition  #1. 


In  the  Figures  5.70  to  5.74  the  responses  to  an  initial  displacement  of 
-2  Mm  from  the  nominal  orbit  along  the  radius  are  shown  (initial 
condition  #2).  This  value  is  the  negative  of  the  initial  condition  #1,  and  it 
is  intended  as  a  test  to  check  whether  the  response  will  display  sensitivity  to 
initial  conditions,  which  was  found  to  be  absent  in  the  NSQF  of  Example  3. 

Comparing  Fig.  5.65  for  the  initial  condition  #1  and  Fig.  5.70  for 
the  response  of  the  radial  deviation  from  the  nominal  orbit,  we  observe 
that  the  tracking  by  the  NSQF  of  the  LMLF  is  quite  satisfactory.  The 
NSQF  displays  a  response  that  is  only  slightly  sensitive  to  initial  conditions. 
We  note  that  when  evaluating  initial  condition  sensitivity  for  the  nonlinear 
system,  we  are  actually  looking  for  the  type  of  behavior  typical  of  linear 
systems:  Are  the  response  curves  symmetric  with  respect  to  the  time  axis 


when  the  sign  of  the  initial  conditions  is  changed?  The  two  curves  of 
NSFQ  in  Figs.  5.65  and  5.70  are  not  exactly  symmetric,  with  a  small 
overshoot  displayed  by  the  NSQF  in  Fig.  5.70  that  is  absent  in  the  plot  of 
Fig.  5.65.  However,  the  difference  is  minor,  and  it  does  not  affect  the 
tracking  of  the  LMLF. 

Comparison  of  x2  in  Figs.  5.71  and  5.66,  of  x3  in  Figs.  5.72  and 
5.67,  and  of  *4  in  Figs.  5.73  and  5.68  between  the  two  sets  of  initial 
condition  responses  #1  and  #2  all  show  that  the  response  of  the  NSQF  is 
slightly  initial  condition  dependent,  but  not  as  strongly  as  the  curves  of  the 
NSLF.  Immediately  after  the  transient  part  of  the  response,  the  NSQF 
approaches  extremely  close  to  LMLF. 

The  plot  for  the  magnitude  of  the  inputs  in  fig.  5.74  has  been 
rescaled  to  distinguish  between  the  different  inputs.  Similar  to  the  input 
plot  of  initial  condition  #1,  in  the  section  of  the  plot  that  is  not  shown,  all 
the  inputs  were  very  close  to  each  other.  This  rescaling  helps  us  in 
distinguishing  the  following  feature:  For  all  the  improvement  achieved  in 
the  tracking  of  the  linear  ideal  model  by  the  NSQF,  it  is  surprising  that  the 
input  effort  needed  for  this  is  actually  smaller  than  those  needed  by  the 
controls  of  LMLF  and  NSLF.  This  is  a  feature  that  was  also  present  in  the 
input  plots  of  Fig.  5.69  of  the  initial  condition  #1. 
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LMLF  ( . ) _ NSLF  ( - ) _ NSQF  ( . ) 

Fig.  5.74.  Magnitudes  of  the  inputs  for  Near  Circular  Satellite  with  nonzero  initial  condition  #2. 


An  angular  displacement  of  1  radian  from  the  nominal  orbit  has  been 
applied  for  the  following  simulations  (x  =  [0  0  1  0]').  The  NSQF  tracks 
the  LMLF  very  closely  in  the  radial  displacement  (Fig.5.75),  radial 
velocity  (Fig.  5.76),  angular  displacement  (Fig.  5.77),  and  angular  velocity 
(Fig.  5.78).  These  response  curves  show  that  the  quadratic  control  is 
surprisingly  effective  in  tracking  the  ideal  linear  model,  especially  when 
compared  with  the  linear  control  of  the  nonlinear  system,  the  NSLF. 
Please  note  that  in  the  simulations  of  both  the  NSLF  and  the  NSQF,  we 
have  integrated  the  truncated  equations  of  motion  presented  in  Eqn.  (5.30). 

The  NSLF  displays  large  transient  errors  and  longer  settling  times  in 
all  the  states.  A  reason  why  the  NSQF  is  so  effective  in  improving  the 
system  behavior  in  responding  like  the  LMLF  in  the  state  of  the  angular 


position  is  because  the  system  is  being  controlled  by  a  tangential  thrust, 
which  directly  affects  the  angular  acceleration. 
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Because  of  the  initially  large  magnitude  of  the  input  thrusts  in  Fig. 
5.79,  the  scale  of  the  plot  was  too  large  to  allow  the  different  inputs  to  be 
clearly  distinguished,  and  we  truncated  a  small  portion  of  the  plot  by 
rescaling  the  axes.  This  is  why  the  input  curves  are  missing  for  the  first 
1/3  hours  of  the  simulation.  In  this  truncated  part,  the  input  values  for  all 
systems  were  as  high  as  3000,  and  they  were  very  close  to  each  other. 
Since  we  are  running  fictitious  simulations,  we  can  afford  such  large  input 
forces.  In  reality,  however,  such  thrust  values  may  not  be  available  for 
being  able  to  recover  from  the  large  initial  deviations  we  have  imposed  on 
the  system,  and  longer  settling  times  would  be  physically  more  realistic. 


LMLF  ( . ) _ NSLF  ( - ) _ NSQF  ( -  . )| 

Fig.  5.75.  Free  response  of  state  Jtj  of  Near  Circular  Satellite  with  nonzero  initial  condition  #3. 
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Figures  5.80  through  5.84  show  simulation  results  for  an  angular 
displacement  of  -1  radian  from  the  nominal  orbit,  an  initial  condition  with 
a  sign  opposite  of  the  previous  case.  In  the  captions  of  these  plots,  this  case 
is  referred  to  as  initial  condition  #4.  The  discussion  for  the  responses  in 
this  case  is  nearly  identical  to  that  of  the  initial  condition  #3.  The  NSQF  is 
extremely  succesful  in  the  tracking  of  the  LMLF  in  all  the  states  in  the 
Figs.  5.80,  5.81,  5.82  and  5.83.  The  sensitivity  properties  to  a  change  in 
initial  conditions  is  excellent,  and  in  this  sense  the  NSQF  behaves  almost 
exactly  like  a  linear  system. 

As  in  the  earlier  simulation,  the  input  effort  is  initially  very  large, 
and  the  difference  between  various  controls  could  be  clearly  seen  only 
when  a  small  initial  portion  of  the  graph  was  truncated. 


LMLF  (  ) _ NSLF  ( - ) _ NSQF  ( . ) 

Fig.  5.80.  Free  response  of  state  Xj  of  Near  Circular  Satellite  with  nonzero  initial  condition  #4. 


Finally,  for  the  last  simulation,  a  sinusoidal  forcing  of  30,000sin(6t) 
has  been  applied  to  the  satellite.  Note  that  this  reference  signal  has  the 
same  units  as  the  input.  The  amplitude  of  the  sinusoidal  forcing  magnitude 
may  seem  very  large,  but  we  note  that  as  seen  in  Eqn.  (5.28),  the  thrust  is 
divided  by  the  mass  and  the  orbital  radius  to  yield  the  angular  acceleration. 
This  value  has  teen  chosen  large  so  as  to  be  able  to  effectively  distinguish 
between  the  behavior  of  the  NSLF  and  the  NSQF.  Such  a  large  reference 
forcing  is  not  physically  realizable,  and  this  portion  of  the  example  is 
simulated  only  for  the  sake  of  demonstrating  the  effectiveness  of  our 
method.  The  period  of  the  disturbance  was  chosen  to  be  close  to  the  orbital 
period  of  the  satellite.  This  type  of  periodic  disturbances  are  typical  for  a 
satellite,  and  they  might  represent  external  effects  such  as  solar  pressure  or 
gravitational  forces  due  to  other  celestial  objects. 

In  the  radial  displacement  response  of  Fig.  5.85  we  observe  the  same 
phenomenon  seen  in  the  simulations  of  Examples  1,  2,  and  3,  i.e.  the 
quadratic  feedback  causes  the  average  value  of  the  sinusoidal  response  to 
approach  zero.  Similar  behavior  is  displayed  with  a  larger  deviation  in  the 
angular  displacement  response  of  Fig.  5.87.  The  average  drift  in  all  the 
models  away  from  the  equilibrium  in  this  fugure  was  very  interesting,  and 
the  simulation  was  extended  until  f  =  20  to  observe  the  steady  state 
behavior,  which  is  shown  in  Fig.  5.88.  The  NSLF  displays  a  large  drift 
from  the  reference  angle.  The  LMLF  shows  a  sinusoidal  response  with 
zero  average.  The  NSQF  is  much  closer  to  the  LMLF  in  average  than  the 
NSLF.  However  it  is  obvious  From  Fig.  5.88  that  the  NSQF  will  also 
exhibit  a  steady  state  average  error.  This  is  because  the  nonlinear  system  is 
not  linearizable,  and  the  error  is  due  to  the  residual  second  degree  terms 
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that  could  not  be  removed  by  coordinate  change  and  feedback.  Note  that 
the  magnitudes  of  the  nonlinearities  are  significant  for  these  values  of  the 
states. 

Figs.  5.86  and  5.89  show  the  radial  and  angular  velocities, 
respectively.  In  these  curves,  the  linear  and  nonlinear  control  responses 
are  not  readily  distinguishable.  The  difference  becomes  noticeable  when 
the  velocities  are  integrated  to  obtain  the  displacements,  as  discussed  above. 
In  the  the  angular  velocity  response  x4  of  the  satellite,  we  observe  a  large 

overshoot  in  the  NSQF.  In  trying  to  correct  the  nonlinearities,  the 
quadratic  controller  introduces  very  large  deviations. 

Fig.  5.90  shows  that  the  input  thrust  for  the  NSQF  also  assumes  very 
large  values  during  the  transient  part  of  the  response.  This  may  be  due  to  a 
tendency  of  the  quadratic  control  to  drive  the  system  unstable  for  large 
deviations  from  the  equilibrium.  However,  whether  this  large  input  force 
is  really  necessary  to  effectively  cancel  the  nonlinear  terms  in  the  system 
(i.e.  are  the  nonlinear  terms  really  so  large  to  need  such  a  strong  force  to 
cancel  them),  or  the  higher  degree  approximation  of  our  approach  starts 
failing  for  these  large  values  of  the  states  is  not  clear. 

Large  values  for  the  input  force  was  also  observed  for  Example  1 
presented  earlier  in  this  chapter.  In  order  to  improve  on  the  performance 
of  the  nonlinear  controller  for  some  systems  that  display  similar  tendencies 
of  instability,  certain  modifications  in  the  design  procedure  may  have  to  be 
made.  It  is  not  entirely  clear  at  this  moment  what  these  modifications 
should  be,  and  this  is  certainly  one  of  the  open  research  questions  that 
extends  from  this  work. 


_ NSLF ( - 
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6.  SUMMARY,  CONCLUSIONS  AND  FUTURE  RESEARCH 
DIRECTIONS 


6.1.  Summary 

In  this  report,  we  have  developed  a  method  for  approximate  linearization 
of  nonlinear  control  systems  using  coordinate  transformation  and  feedback 
pairs.  First,  a  series  expansion  of  a  given  nonlinear  system  at  some 
nominal  operating  point  was  obtained.  Based  on  the  Hunt-Su  theorem 
[17,18]  and  the  approximate  linearization  results  in  [29]  we  formulated  the 
problem  of  linearizing  a  nonlinear  control  system  by  nonlinear  coordinate 
transformation  and  nonlinear  feedback.  The  form  of  the  coordinate  change 
and  feedback  were  chosen  such  that  in  the  vicinity  of  the  nominal  operating 
point  the  transformation  approaches  the  identity  map.  The  differential 
equations  of  the  system  were  then  evaluated  in  the  new  set  of  coordinates 
using  the  given  coordinate  change  and  feedback. 

With  the  goal  of  eliminating  the  nonlinear  terms  of  a  given  degree  in 
the  series  expansion,  we  obtained  the  Generalized  Homological  Equations. 
These  equations  were  evaluated  after  choosing  a  suitable  basis  in  which  the 
pth  degree  vector  valued  monomials  were  expressed.  When  the 
coefficients  of  the  monomials  of  similar  powers  were  set  equal  to  each 
other,  we  obtained  a  linear  system  of  equations  in  the  unknown  coefficients 
of  the  coordinate  change  and  feedback,  i.e.  in  the  coefficients  of  <j>(2)(jc), 
a(2)(x)  and  P(1)(jc).  Since  these  terms  are  coefficients  of  monomials  of 
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degree  p,  the  resulting  system  of  linear  equations  is  approximately  of  order 
n  p  +  1  where  n  is  the  dimension  of  the  nonlinear  vector  field. 

The  calculation  procedures  to  solve  this  set  of  equations  were  then 
implemented  in  the  MATLAB  computer  program.  MATLAB  is  an 
application  language  with  a  rich  collection  of  matrix  based  computational 
algorithms.  There  are  toolbox  packages  available  for  control  systems, 
system  identification,  and  signal  processing.  It  is  also  a  programming 
environment  with  a  convenient  user  interface.  By  using  the  built-in 
subroutines  available  in  the  package  and  writing  additional  subroutines, 
functions  and  procedures  one  can  create  programs  that  automate  the 
solutions  of  mathematical  problems.  This  package  proved  to  be  a  very 
suitable  tool  for  writing  the  computer  program  that  implements  the  results 
of  this  report. 

The  computer  program  prompts  the  user  for  the  input  parameters  of 
the  system  to  be  linearized  such  as  order  of  the  system  and  number  of 
inputs,  characteristic  scales  for  each  state  and  input,  linear  part  of  the  plant 
and  the  input  coefficient,  and  nonlinear  terms  of  second  degree  in  the  states 
and  the  inputs.  After  obtaining  the  parameters  of  the  system,  the  program 
first  calculates  the  linear  and  nonlinear  parts  of  the  system  in  the  scaled 
coordinates.  A  linear  set  of  equations  in  the  unknown  coefficients  of  the 
coordinate  change  and  feedback  is  then  obtained.  This  large  system  of 
equations  is  solved  by  a  singular  value  decomposition  subroutine.  The 
solution  obtained  is  presented  to  the  user  as  the  result  of  the  first  part  of 
the  program.  At  this  point,  having  found  the  coordinate  change  and 
feedback  needed,  one  can  terminate  the  program,  or  continue  to  design  a 
feedback  law  for  the  linear  part  of  the  system. 
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In  the  program,  the  nonlinear  feedback  needed  to  achieve  the  second 
degree  linearization  is  calculated  in  the  scaled  natural  coordinates.  Prior  to 
running  simulations,  symbolic  expressions  for  the  differential  equations  are 
generated  to  speed  up  the  integrations.  Three  different  models  are 
simulated  using  the  above  feedback:  1)  An  exactly  linear  model  (which 
agrees  with  the  linear  part  of  the  system)  to  serve  as  a  benchmark,  2)  The 
nonlinear  system  with  a  linear  feedback  design  based  on  a  first  degree 
approximation,  3)  The  nonlinear  system  that  is  linearized  up  to  second 
degree  with  a  quadratic  feedback  and  with  the  additional  linear  feedback  as 
in  1)  and  2). 

Various  forms  of  initial  conditions  and  reference  input  signals  (such 
as  impulses,  steps  and  sinusoidal  inputs)  can  be  applied  to  all  three  systems. 
They  are  integrated  for  a  specified  length  of  time.  At  the  end  of  the 
simulations,  the  resulting  state  trajectories  are  converted  back  to  natural 
unsealed  coordinates  and  plotted  for  comparison.  The  feedback  inputs  for 
all  three  simulations  may  also  be  plotted.  Phase  portraits  of  the  states  can 
also  be  made.  Once  the  plots  are  completed,  one  can  extend  the  simulations 
for  a  longer  period  of  time,  apply  different  inputs  for  a  different  set  of 
simulations,  design  different  forms  of  feedback  for  the  linear  part  of  the 
system  or  terminate  the  program. 

Using  the  computer  program  as  a  tool,  we  found  nonlinear  feedback 
and  coordinate  transformation  pairs  for  various  example  nonlinear 
systems.  We  tested  the  resulting  closed  loop  systems  with  different  initial 
conditions  and  impulse,  step  or  sinusoidal  forcing  inputs.  The 
improvement  of  the  second  degree  approximation  over  the  first  degree 
design  was  superior  in  almost  all  the  cases  that  were  tested.  The  systems 


that  are  exactly  linearizable  showed  the  best  improvement,  and  when  the 
errors  due  to  the  second  degree  terms  in  the  coordinate  change  became 
small,  the  behavior  of  the  linearized  systems  were  almost  exactly  linear. 
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For  the  type  of  systems  that  are  not  exactly  linearizable,  there  was  a 
noticable  improvement,  especiallly  in  the  steady  state  responses  to  a 
disturbance.  The  most  impressive  results  were  obtained  when  the  systems 
were  forced  by  a  sinusoidal  forcing  function.  Unlike  linear  systems,  the 
response  of  nonlinear  systems  to  a  sinusoidal  forcing  do  not  usually  have 
zero  mean.  Depending  on  the  nonlinear  termsthat  are  present,  there  may 
be  a  nonzero  bias  value  which  is  approximately  the  average  of  the 
nonlinearities  over  a  period  of  the  forcing  function.  In  this  case,  it  was 
observed  from  the  simulations  that  the  quadratic  feedback  decreased  this 
bias  term,  and  for  exactly  linearizable  systems  the  bias  was  exactly  zero  in 
the  steady  state.  The  type  of  disturbances  a  control  system  encounters  in 
the  real  world  are  not  impulse  or  step  inputs  but  noisy  signals  in  general. 
The  linearized  systems  have  not  been  tested  with  noisy  inputs.  However, 
the  response  behavior  for  the  sinusoidal  inputs  may  be  an  indication  that 
the  improvements  will  be  satisfactory. 


6.2.  Conclusions  and  Future  Research  Directions 

One  of  the  main  contributions  of  this  report  is  obtaining  a  numerical 
solution  to  the  approximate  linearization  problem  for  nonlinear  control 
systems.  The  exact  solvability  of  the  homological  equations  and  the 
linearizability  of  a  system  according  to  the  Hunt  Su  theorem  (up.  to  the 
specified  degree)  are  equivalent  conditions.  Therefore  whenever  a  control 
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system  is  not  exactly  linearizable,  it  is  impossible  to  find  an  exact  solution 
to  this  system  of  linear  equations.  When  a  solution  exists,  the  coordinate 
change  and  feedback  that  are  found  transform  the  nonlinear  system  into  an 
exactly  linear  system  (up  to  the  specified  degree)  in  a  new  set  of 
coordinates. 

When  there  is  no  exact  solution,  we  have  solved  a  least  squares 
problem  by  finding  an  approximate  solution  to  the  system  of  equations 
using  the  singular  value  decomposition.  Before  finding  an  approximate 
solution,  the  state  variables  were  normalized  according  to  their 
characteristic  scales.  By  defining  the  least  squares  problem  in  a  consistent 
way,  we  minimized  the  norm  of  the  error  between  an  approximate  solution 
and  a  “nearby”  exact  solution.  As  shown  in  Section  3,  this  is  equivalent  to 
solving  the  associated  singular  value  decomposition  problem  for  the  linear 
system  of  equations  in  the  scaled  coordinates. 

The  Hunt-Su  theorem  is  severe  in  its  restriction  that  a  transformation 
and  feedback  that  linearizes  a  nonlinear  system  has  to  achieve  the 
transformation  exactly,  i.e.  up  to  an  infinite  degree  in  the  series  expansion 
of  the  nonlinear  terms.  When  it  exists,  one  can  calculate  the  coordinate 
change-feedback  pair  using  the  Hunt-Su  theorem.  If  the  Hunt-Su  test  fails, 
there  is  nothing  more  one  can  do  to  linearize  the  system  within  that 
framework.  One  of  the  contributions  of  this  report  is  to  offer  the  above 
described  approximate  solution  to  the  linearization  problem  in  the  case 
when  the  Hunt-Su  theorem  fails. 

The  improvement  achieved  by  our  method  depends  on  whether  or 
not  before  the  linearization  a  given  nonlinear  system  was  far  away  from 


the  “minimum”  solution  yielded  by  the  singular  value  decomposition 
method.  Obviously,  another  important  factor  in  the  performance  of  the 
linearized  system  is  the  relative  magnitude  of  the  nonlinear  terms  in  the 
system  compared  to  the  linear  part.  If  the  nonlinearities  are  very  small, 
the  linear  part  of  the  system  will  dominate  the  behavior,  and  the 
improvements  in  such  a  case  may  not  be  significant. 

The  numerical  calculations  needed  for  the  transformation  and 
feedback  for  a  second  degree  linearization  are  computationally  expensive. 
Further  improvements  in  the  calculation  speeds  of  the  transformation  and 
feedback  may  be  possible  (perhaps  with  the  goal  of  a  real-time 
implementation)  by  removing  some  of  the  redundancies  in  the  linear 
system  of  equations  that  are  obtained  from  the  homological  equations. 

A  next  step  in  this  research  may  be  the  formulation  of  the  third 
degree  linearization  problem.  It  is  expected  that  a  third  degree 
linearization  will  improve  on  the  linear  behavior  of  a  nonlinear  system  in  a 
larger  neighborhood  of  the  operating  point.  However,  the  numerical 
calculations  necessary  in  this  case  are  even  more  burdensome.  The 
question  that  arises  is  whether  a  third  degree  linearization  for  a  single 
operating  point  contributes  a  justifiable  improvement  for  a  broader  range 
in  the  state  space,  or  one  should  perform  successive  second  degree 
linearizations  at  more  than  one  nominal  point.  The  answer  to  this  question 
is  not  clear  and  may  depend  on  the  severity  of  the  nonlinearities,  or  on  the 
individual  application. 

Since  we  have  not  solved  the  third  degree  linearization  problem  in 
this  report,  a  performance  comparison  with  the  second  degree  linearization 
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is  not  possible  at  this  time.  Provided  a  given  system  is  exactly  linearizable, 
a  third  degree  linearization  may  be  a  worthwhile  pursuit.  However,  if  the 
system  is  not  exactly  linearizable,  since  there  are  some  second  degree  terms 
that  can  not  be  removed,  the  benefits  of  a  third  degree  linearization  in  the 
presence  of  these  second  degree  terms  may  not  be  crucial.  This  argument 
also  depends  on  the  relative  magnitudes  of  the  third  degree  terms. 

It  should  be  pointed  out  that  the  formulation  of  the  third  and  even 
higher  degree  linearizations  are  similar  to  the  second  degree  linearization, 
as  shown  in  Chapter  3.  Aside  from  some  additional  technical  difficulty  of 
dealing  with  a  higher  dimensional  system  of  equations  (the  sizes  of  which 
are  directly  related  to  the  presence  of  third  -or  higher-  degree  monomials) 
the  numerical  solution  is  also  the  same. 

During  the  testing  of  the  computer  program  with  various  examples, 
it  was  observed  that  the  stability  characteristics  of  the  example  systems 
were  sometimes  adversely  affected.  The  nonlinear  terms  that  are  present 
in  a  system  may  occasionally  augment  system  stability  in  certain  regions  in 
the  state  space,  which  may  depend  on  their  magnitudes  and  signs  at  any 
given  point  in  the  trajectory.  Attempting  to  remove  these  nonlinear  terms 
may  sometimes  cause  a  system  to  have  a  smaller  basin  of  stability  after 
linearization. 

Our  central  focus  in  this  report  has  been  the  improvement  on  the 
response  of  a  nonlinear  system  in  favor  of  a  linear  one,  and  the  stability 
issue  has  been  considered  only  locally,  i.e.  for  the  linear  part  of  the  system. 
Clearly,  in  a  reasonably  close  neighborhood  of  the  origin,  the  stability 
properties  will  be  dominated  by  the  linear  part  of  the  vector  field.  Since 


linear  systems  can  be  made  globally  asymptotically  stable,  it  was  initially 
expected  that  a  system  that  has  been  approximately  linearized  would  always 
have  a  larger  basin  of  stability  compared  to  the  same  system  before  the 
approximate  linearization.  This  intuitive  generalization  is  not  correct  in 
many  cases. 

The  issue  of  nonlinear  stability  is  a  difficult  problem  that  has  not 
been  completely  solved.  The  stability  of  a  system  with  nonlinear  feedback 
of  the  type  we  are  proposing  may  depend  on  important  properties  of  the 
overall  system  such  as  nonlinear  controllability,  loss  of  convergence  in  the 
series  expansion  of  the  vector  field,  a  loss  of  rank  in  the  inversion  of  the 
coordinate  change  and  feedback,  etc.  These  issues  are  beyond  the  scope  of 
this  report,  but  the  investigation  of  stability  properties  of  the  method  is  a 
natural  extension  of  the  work. 

The  results  presented  in  this  report  and  their  implementation  in  a 
computer  program  that  yields  the  solution  to  approximate  linearization 
problems,  help  us  to  analyze  various  control  systems  and  to  gain  further 
insight  into  the  nonlinear  behavior  of  control  systems.  Valuable  experience 
was  obtained  from  the  various  numerical  experiments  that  have  been 
performed.  Future  research  directions  will  certainly  be  influenced  by 
these  observations  as  well. 
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APPENDIX  A: 

AN  EXAMPLE  SESSION  FOR  THE  COMPUTER  PROGRAM 

The  following  example  session  has  been  recorded  dunng  a  sample  run  of 
the  Approximate  Linearization  program.  The  MATLAB  command  “diary 
on”  starts  recording  the  screen  output.  The  command  “diary  off’  stops  the 
recording  of  the  session.  Note  that  the  graphical  output  and  anything  typed 
by  the  user  can  not  be  recorded.  The  user  responses  have  been  added  later 
and  they  are  emphasized  as  boldface  type. 


mainmenu 


SECOND  DEGREE  APPROXIMATION 
TO  NONLINEAR  CONTROL  SYSTEMS 

Controller  Version 

by 

Sinan  Karahan 


Welcome  to  the  Second  Degree  Linearization  Program 
Press  a  key  to  start 

- MAIN  MENU - 

1)  Help  on  program  and  functions 

2)  Enter  new  nonlinear  system  parameters 

3)  Solve  the  second  degree  linearization  problem 

4)  Feedback  design  for  the  models 

5)  Simulate  the  system  and  compare  performances 

6)  Exit 


Select  a  menu  number:  1 

% 

%This  help  routine  presents  a  *very*  brief  introduct ion--to 
%the  second  degree  approximation  program.  For  more  detailed 
%information  about  the  program  and  the  background  theory 
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%please  consult  to  the  accompanying  "Second  Degree 
% Approximation  to  Nonlinear  Control  Systems  using  MATLAB: 
%Users'  Manual", 
echo  off 
Press  any  key 
% 

%  The  program  solves  for  second  order  approximations  of 
%  nonlinear  control  systems .  Your  control  system  should  be 
%  in  the  following  format: 

% 

%  dx  (2)  (1)  3 

%  —  =  Fx  +  Gu  +  f  (x)  +  g  (x)u  +  0(x,u) 

%  dt 

% 

%  where  F  is  the  n*n  plant  matrix,  G  is  the  n*l  input  vector, 
%  (2)  (1) 

%  f  is  the  second  degree  part  of  the  vector  field,  g 
%  is  the  first  degree  part  of  the  input  coefficients.  All  of 
%  these  terms  have  to  be  calculated  ;  .  the  user  from  the 
%  series  expansionof  the  control  system  at  the  nominal 
%  operating  point  0.  For  further  details  on  the  individual 
%  subroutines  and  function  programs  called  by  the  routines, 

%  use  the  help  utility  in  MATLAB. 
echo  off; 

Press  any  key 

- MAIN  MENU - 

1)  Help  on  program  and  functions 

2)  Enter  new  nonlinear  system  parameters 

3)  Solve  the  second  degree  linearization  problem 

4)  Feedback  design  for  the  models 

5)  Simulate  the  system  and  compare  performances 

6)  Exit 


Select  a  menu  number:  2 
-  Input  one  of  the  following:  - 

1)  Enter  data  manually 

2)  Enter  a  filename  to  retrieve  data 


Select  a  menu  number: 

-  For  entry  of  data,  choose 

1)  Novice  mode 

2)  Expert  mode 


Select  a  menu  number: 

enter  order  of  the  system,  n:  3 

enter  dimension  of  the  control,  m:  1 
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Enter  a  characteristic  scale  for  each  state  (used  for 
normalizing) 

scale  of  xl  =  1 
scale  of  x2  =  1 
scale  of  x3  =  1 

Enter  a  characteristic  scale  for  each  input 
scale  of  ul  =  1 

Enter  3x3  plant  matrix,  F:  [0  1  0;0  0  l;-3  2  -1] 

0  10 

0  0  1 

-3  2-1 

Enter  3x1  control  matrix,  G:  [  0  ;  0  ;  1  ] 

0 

0 

1 

Enter  the  coefficients  of  the  second  degree  terms  f2  in  the 
series  expansion  of  the  control  system  in  the  following 
order : 

xlxl  xlx2  xlx3  x2x2  x2x3  x3x3 
Note  the  dimensions  of  f2 : 

3  rows  [for  each  state  equation  from  1  to  3]  by  6  columns 
[for  the 

coefficients  of  above  terms]. 

f2  =  [1  0  -1  0  0  0;0  -1  0  1  0  1;0  0  0  0  1  0] 

10-1000 
0-10101 
0  0  0  0  1  0 

Enter  coefficients  of  the  first  degree  terms  in  gl  in  the 
form  of  a  3  x  3  matrix,  where 
[1] 

g  (x) *u  =  gll*x*ul 

each  gli  is  a  3  by  3  matrix,  and  gl  is  formed  of  row  blocks 
of  gli. 

gl  =  [1  0  0 ;  0  0  0;0  1  0] 

10  0 

0  0  0 

0  10 


Enter  a  file  name  to  store  this  data:  sessiondat 
Note:  The  variables  are  being  saved  in  the  following  order: 
n  m  xscale  uscale  f  g  f2  gl 


Enter  one  of  the  following 
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1)  Display  the  variables  n  m  xscale  uscale  f  g  f2  gl 

2)  Return  to  main  menu 


Select  a  menu  number:  1 
Your  system  is  as  follows: 


Dimension  of  the  system: 
n 


3 

Dimension  of  the  control 
m 


1 

Scale  factors  of  the  states,  xl  through  xn  : 
111 

Scale  factor(s)  of  the  input (s^,  ul  through  urn  : 
1 

Press  any  key 

Linear  part  of  the  plant: 

F 


0  10 

0  0  1 

-3  2  -1 

Constant  part  of  the  input  vector: 

G 

0 

0 

1 

Press  any  key 

Second  degree  part  of  the  plant: 

(2) 

f  (x) 

f 2  ( 1 )  =  +x(l)  *x(l)-x(l)  *x(3) 
f  2  (2)  =  -x(l)*x(2)+x(2)*x(2)+x(3)*x(3) 
f  2  (3)  =  +x  (2)  *x  (3) 

Press  any  key 

First  degree  coefficient  of  the  input (s): 
(Each  gli  multiplies  i'th  input  ui) 


Corrections  should  be  done  by  choosing  #2  from  the  main  menu. 
Press  any  key 


MAIN  MENU - 

1)  Help  on  program  and  functions 

2)  Enter  new  nonlinear  system  parameters 

3)  Solve  the  second  degree  linearization  problem 

4)  Feedback  design  for  the  models 

5)  Simulate  the  system  and  compare  performances 

6)  Exit 


Select  a  menu  number:  3 
•  Calculating  the  scaled  variables... 


We  seek  a  quadratic  change  of  coordinates 
(2) 

z  =  x  -  phi  (x) 
and  feedback 

(2)  (1) 
v  =  alpha  (x)  +  (I  +  beta  (x) ) .u 
which  transforms  our  system  into  the  form 

dz  2  3 

- -  Fz  +  Gv  +  R(z,v)  +  0(z,v) 

dt 

where,  if  the  system  is  exactly  linearizable,  R(z,v) 
(residual)  is  zero. 


We  use  the  homological  equations 


(2)  (2)  (2) 


[  F.x, 

phi 

(x) 

]  +  alpha 

(x) 

=  f  (x) 

(2) 

(1) 

(1) 

[  G.u, 

phi 

(x) 

]  +  G.beta 

(x) 

=  g  (x) .u 

•  Calculating  the  system  of  equations... 

.  2 

•  We  construct  a  large  linear  system  LX  =  B  in  the  0(x,u) 


•  coefficients  of  the  homological  equations.  This  system  has 

• 

•  2  2 

•  ROWS:  n  (n+l)/2  +  mn 

• 

•  2  2 

•  COLUMNS:  n  (n-fl)/2  +  mn(n+l)/2  +  m  n 

• 

•  In  general,  the  column  rank  is  deficient  and  the  solution 

•  is  overdetermined. 

•  We  use  the  SVD  algorithm  to  get  the  "nearest"  possible 

•  solution. 

•  Solving  for  the  coordinate  change  and  feedback... 

•  SVD  may  take  a  while,  please  wait. 

•  Calculations  done. 

Press  any  key 

-  Please  choose  a  menu  item  - 

1)  Display  coordinate  change  and  feedback 

2)  Show  the  closest  linearizable  system 

3)  Exit  to  main  menu 


Select  a  menu  number:  1 
press  any  key 

Phi2  in  the  second  degree  coordinate  change  z=x-phi2 (x) : 

phi2  ( 1)  =  +0.196  97  *x(l)  *x(l)  +x  (1)  *x(3)-0.5*x(2)  *x(2) 
phi 2  (2)  =  -4*x(l)  *x { 1) +2 . 3  939*x  ( 1)  *x(2) 

phi2 (3)  =  -7*x(l) *x(2)+2.3939*x(l) *x(3)+1.3939*x(2) *x(2)- 
x<3) *x  (3) 

Press  any  key 

Second  degree  part  of  feedback:  Alpha2 

alpha2  ( 1 )  =  -1 . 4091*x (1) *x (1) +7*x (1) *x  (2) - 

2*x(l) *x(3)+7.1061*x(2) *x (2) -0 . 18182*x (2) *x (3) -x (3) *x  (3) 

Press  any  key 

First  degree  part  in  the  feedback:  Betal 
betal  (1,  1)  =  -2 ,3939*x(l) +x(2) +2*x(3) 
press  any  key 

-  Please  choose  a  menu  item  - 


1)  Display  coordinate  change  and  feedback 

2)  Show  the  closest  linearizable  system 

3)  Exit  to  main  menu 


Select  a  menu  number:  2 

The  system  is  exactly  linearizable  up  to  degree  2. 


Please  choose  a  menu  item 


1)  Display  coordinate  change  and  feedback 

2)  Show  the  closest  linearizable  system 

3)  Exit  to  main  menu 


Select  a  menu  number:  3 
- MAIN  MENU - 

1)  Help  on  program  and  functions 

2)  Enter  new  nonlinear  system  parameters 

3)  Solve  the  second  degree  linearization  problem 

4)  Feedback  design  for  the  models 

5)  Simulate  the  system  and  compare  performances 

6)  Exit 


Select  a  menu  number:  4 

Closed  loop  feedback  design  for  the  linear  part  of  the  plant 
-  Input  one  of  the  following:  - 

1)  Specify  closed  loop  eigenvalues 

2)  Design  Linear  Quadratic  Optimal  feedback 


Select  a  menu  number:  2 

Quadratic  regulator  will  minimize:  integral (x 'Qx  +  u'Ru)dt 
Enter  3  by  3  <positive  semi-def inite>  matrix  Q 

Q  =  [1  0  0;  0  1  0;  0  0  1] 

Enter  1  by  1  <positive  definite>  matrix  r 

R  =  1 

The  gains  found  are: 
kfeed  = 

0.1623  6.9158  2.9789 

The  resulting  closed  loop  eigenvalues  are: 
cleig  = 

-0.7245  +  0 . 8515i 
-0.7245  -  0 . 8515i 
-2.5299 
Press  a  key 

- MAIN  MENU - 

1)  Help  on  program  and  functions 

2)  Enter  new  nonlinear  system  parameters 

3)  Solve  the  second  degree  linearization  problem 

4)  Feedback  design  for  the  models 

5)  Simulate  the  system  and  compare  performances 

6)  Exit 
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Select  a  menu  number:  5 

•The  following  three  systems  will  be  simulated  in  scaled 
coordinates : 

[1]  dx/dt  =  (F+G.K) .x  +  r 

• 

*  (2)  (1)  (1) 

[2]  dx/dt  =  (F+G.K) x  +  f  (x)  +  g  (x)Kx  +  (G  +  g  (x))r 

• 

*  (2)  (1) 

[3]  dx/dt  =  Fx  +  f  (x)  +  (G  +  g  (x))u(x) 

• 

•  where  r  is  a  reference  input  signal. 

•  The  nonlinear  feedback  u(x)  in  [3]  is  : 

•  -1 

•  u(x)  =  (I  +  betal (x) )  {K(x  -  phi2 (x) )  -  alpha2 (x)  +  r} . 

• 

•  [2]  is  the  nonlinear  system  with  a  linear  feedback  design. 

•  [3]  is  the  same  system  with  linear  +  quadratic  feedback. 

•  [1]  is  a  linear  reference  model.  It  is  expected  that  as 

•  x  approaches  zero,  [3]  will  agree  with  [1]  closer  than  [2] . 

Calculating  symbolic  expressions  for  [1],  [2]  and  [3]... 

Press  any  key 

-  Simulation  Menu  - 

1)  Free  response-nonzero  initial  conditions 

2)  Impulse  input-zero  or  nonzero  initial  conditions 

3)  Step  input-zero  or  nonzero  initial  conditions 

4)  Sinusoid  input-zero  or  nonzero  initial  conditions 

5)  Exit  to  main  menu 

Select  a  menu  number:  1 

Enter  initial  conditions  (column  vector):  xO  = [0 . 1; 0 . 1 ; 0 . 1] 
Simulation  will  start  from  initial  time  tO  =  0. 

Enter  final  time  (*scaled*  time) :  tf  =  5 

•  Simulation  of  the  linear  model  done 

•  Simulation  of  the  nonlinear  system  with  linear  control  done 

•  Simulation  of  the  nonlinear  system  with  quadratic  control 
done 


Plot  Menu  - 

1)  Plot  time  response 

2)  Plot  phase  portrait 

3)  Extend  simulation  for  some  more  time 

4)  Return  to  simulation  menu 


Select  a  menu  number:  1 

Legend  for  plots  (colors  seen  in  color  monitors  only) : 

#1:  Linear  reference  model  ( _  Solid;  red) 

#2:  Nonlinear  system  with  linear  feedback  Dashdot; 

green) 

#3:  Nonlinear  system  with  quadratic  feedback  ( - Dashed; 

blue) 

Enter  simulations  to  see:  "1"  for  #1,  "12"  for  #1&#2,  etc 
Please  enter  the  number (s)  in  increasing  order 

Enter  curve  number (s)  :  123 

Press  a  key  to  see  each  plot 

(The  plots  are  viewed  on  the  graphics  screen  of  MATLAB) 

Do  you  want  to  see  the  control  input  magnitudes? (y/n) :  y 
•  Calculating  the  magnitudes  of  the  control  inputs... 

(The  plot  is  viewed  on  the  graphics  screen  of  MATLAB) 

-  Plot  Menu  - 

1)  Plot  time  response 

2)  Plot  phase  portrait 

3)  Extend  simulation  for  some  more  time 

4)  Return  to  simulation  menu 

Select  a  menu  number:  4 
-  Simulation  Menu  - 

1)  Free  response-nonzero  initial  conditions 

2)  Impulse  input-zero  or  nonzero  initial  conditions 

3)  Step  input-zero  or  nonzero  initial  conditions 

4)  Sinusoid  input-zero  or  nonzero  initial  conditions 

5)  Exit  to  main  menu 


Select  a  menu  number:  5 
- MAIN  MENU - 

1)  Help  on  program  and  functions 

2)  Enter  new  nonlinear  system  parameters 

3)  Solve  the  second  degree  linearization  problem 

4)  Feedback  design  for  the  models 

5)  Simulate  the  system  and  compare  performances 

6)  Exit 

Select  a  menu  number:  6 

YYYYYYYYYYYYYYYYYYYYYYY 
YY  Have  a  good  day  !  YY 
YYYYYYYYYYYYYYYYYYYYYYY 
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Normal  Forms  for  Linear  and  Nonlinear 

Systems* 

Arthur  J.  Krener* 


1  Introduction 

It  is  well-known  that  a  state  space  description  of  a  controllable  linear  system 
can  be  transformed  to  controllable  or  controller  form  by  a  linear  change  of  state 
variables.  A  state  space  description  of  an  observable  linear  system  can  be  trans¬ 
formed  to  observable  or  observer  form  by  a  linear  change  of  state  variables. 
Moreover  the  former  are  closely  related  to  right  matrix  fractional  descriptions 
(RMFD)  of  the  transfer  function  and  the  latter  are  closely  related  to  left  ma¬ 
trix  fractional  descriptions  (LMFD).  These  facts  can  be  found  in  many  texts 
such  as  Wolovich  jl]  or  Kailath  [2).  (The  reader  should  be  warned  that  the 
controllable/controller  and  observable/observer  terminologies  are  not  standard, 
we  follow  that  of  [2]).  Unfortunately  there  is  no  one  treatment  of  this  material 
which  is  suitable  for  our  purposes  so  we  devote  Sections  2  and  3  to  a  review. 
This  is  by  way  of  preparation  for  our  discussion  of  the  existence  and  uniqueness 
of  normal  forms  for  nonlinear  systems  in  Sections  4  and  5.  Our  treatment  gen¬ 
eralizes  Zeitz  [22]  who  discussed  similar  forms  for  scalar  input  and  scalar  output 
nonlinear  systems. 


2  Linear  Normal  Forms 

Throughout  this  paper  we  shall  use  the  following  notation.  The  indices  llt. ..  ,Lq 
are  positive  integers  summing  to  n.  A  prime  triple  (A,B,C)  with  indices  t\, . . . ,  lq 
is  a  triple  of  block  diagonal  matrices  of  dimension  n  x  n,  n  x  m  and  pxnof  the 


‘This  paper  i3  in  final  form  and  no  version  of  it  is  submitted  elsewhere. 

^Research  supported  in  part  by  the  Air  Force  Office  of  Scientific  Research  under  85NM215. 
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d. 


form 


A  =  BlockDiag. 


0  1 


-I  f,  X  f, 


1 

0 


0 


f,xl 


B  =  BlockDiag. 


0 

1 


G  =  BlockDiag.  [  1  0  •••  0  ]'*'• 

The  “prime”  terminology  was  introduced  by  Morse  [3|. 
Consider  the  linear  state  space  description 


=  Ft  +  Gu 

V  =  Ht 


(1) 


(2) 

(3) 

(4) 

(5) 


where  fn*nt  G”xm  an(j  ffr'xn  SyS(;em  said  to  be  controllable  if 


rank  {F”  1 G3  :  j  =  1, . . . ,  m;  r  =  1, . . . ,  n}  =  n  (6) 

(Note:  FT  denotes  the  r,h  power  of  F,  G3  denotes  the  jth  column  of  G  and  H, 
is  the  ith  row  of  H .) 

Every  controllable  linear  system  has  controllability  indices  £i,...,£m  >  0 
characterized  by  £ i  +  . . .  +  tm  —  n  and 

rank{Fr_1GJ  :  j  =  1, . . .  ,m;  r  =  1,  ...,£}  = 

^ank{.Fr~lG,  :  j  =  1, . . . ,  m;  r  =  1, . . . ,  t  A  £y}  (7) 

for  l  ~  The  minimum  of  l  and  tj  is  denoted  by  £,  A  tj.  The  set  of 

controllability  indices  is  uniquely  determined  by  F  and  H  and  does  not  change 
under  linear  state  feedback.  There  can  be  some  freedom,  m  in  the  ordering  of  the 
controllability  indices  even  when  the  ordering  of  the  inputs  remain,  fixed.  This 
is  because  there  may  be  several  orderings  which  satisfy  (7).  Of  course  a  change 
of  variables  in  the  input  space  or  a  reordering  of  the  inputs  can  change  the  order 
of  the  controllability  indices.  We  could  reorder  the  inputs  so  that  £j  <  ...£m 
or  the  reverse  but  we  shall  not  do  so.  To  simplify  notation  we  shall  restrict  our 
attention  to  systems  where  the  controllability  indices  are  positive,  £lt . . . ,  £,„  >  1. 
A  general  system  can  be  made  to  satisfy  this  condition  by  deleting  dependent 
columns  of  G. 
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An  alternative  characterization  of  property  (7)  of  the  controllability  indices 
is  that 

Fl’G3  =  0  (8) 

mod  {Fr~1Gt  :  t  =  1, . . . ,  m;  r  =  1,  ...,(£/+  1)  A  A}- 

The  controllabilities  indices  of  (4),  (5)  are  said  to  be  strict  if  (8)  holds 
mod  {Fr~lG'  :  i  =  r  =  1, . . .  ,t3  A  A}-  The  controllability  indices  are 

strict  iff  there  is  only  one  ordering  of  the  controllability  indices  satisfying  (7). 

It  is  always  possible  to  make  a  linear  change  of  input  coordinates  u  =  f)u 
that  makes  the  controllability  indices  strict  for  the  new  pair  ( F,G )  =  ( F,Gp~l ) 
without  changing  their  order.  One  way  of  accomplishing  this  to  define  1  X  n 
vectors  K3 , . . . ,  Km  by 


KiFT~lG3  = 


{ 


0 


1  <  r  <  l, 
r=i3- 


(9) 


and  let  P  be  the  m  X  m  non-singular  matrix  whose  i  —  j  entry  is 

p\  =  KiFt'~1G3 .  (10) 

In  this  case  ft  satisfies 

Pi  =  S(  A  <  lj.  (11) 

Moreover  p  is  the  only  such  matrix  which  makes  the  controllability  indices  strict 
and  leave  the  order  invariant.  A  change  of  input  coordinates  u  =  A u  preserves, 
the  strictness  of  the  controllability  indices  while  leaving  the  order  invariant  iff 
A}  =■  0  for  A  >  (■}• 

The  system  (4),  (5)  is  said  to  be  observable  if 


rank  {HiFr  1  :  t  =  1, . . .  ,p;  r  =  1, . . . ,  n}  =  n.  (12) 


Every  observable  linear  system  has  observability  indices  >  0  charac¬ 
terized  by  +  ip  =  n  and 

rank  {H,Fr~1  :  i  =  r  =  1,  ...,£}  = 

rank  {HiFr~i  :  i  =  1,. . .  ,p;  r  =  1, . . . ,  l  A  A)  (13) 

for  l  =  The  set  of  indices  is  uniquely  determined  by  H  and  F  and 

does  the  change  under  linear  change  of  coordinates  in  the  state  and  output 
spaces  and  linear  output  injection.  There  can  be  freedom  in  the  ordering  of  the 
observability  indices  even  when  the  order  of  the  outputs  remains  fixed.  This  is 
because  there  may  be  several  orderings  which  satisfy  (13).  Of  course  a  change 
of  output  variables  or  a  reordering  of  the  outputs  can  change  the  order  of  the 
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observability  indices.  We  could  reorder  the  outputs  so  that  £1  or  the 

reverse  but  we  shall  not  do  so.  We  shall  restrict  our  discussion  to  systems  where 
all  the  observability  indices  are  positive. 

Similarly  an  alternative  characterization  of  property  (13)  of  the  observability 
indices  is  that 

HiFe'  =  0  (14) 

mod  {HjFr~l  :  j  =  1, .. .,  p;  r  =  1,  ...,(£,+  l)  A  £>}. 

The  observability  indices  of  (4),  (5)  are  said  to  be  strict  if  (14)  holds  mod  {H  — 
jFr~l  :  j  =  l,...,p;  r  =  1, . . . ,  £»  a£j}.  The  observability  indices  are  strict  iff 
there  is  only  one  ordering  of  them  satisfying  (13).  It  is  always  possible  to  make  a 
linear  change  of  output  coordinates  y  =  iy  that  makes  the  observability  indices 
strict  for  the  new  pair  (H,  F)  =  F)  while  not  changing  their  order.  One 

way  of  accomplishing  this  is  to  define  n  x  1  vectors  Q1, . . .  ,QP  by 


HtF'~lQ’  =  |  ° 

1  <  r 

r  = 

<  u 

£1 

(15) 

and  let 

7  be  the  p  x  p  non-singular  matrix 

whose 

t  -  jth  entry  is 

7?  =  HtF(’ 

~lQ3- 

(16) 

In  this 

case  7  satisfies 

II 

j*-, 

Al 

(17) 

Moreover  7  is  the  only  such  matrix  which  makes  the  observability  indices  strict 
and  leaves  the  order  invariant  because  a  change  of  output  coordinates  y  =  py 
preserves  the  strictness  and  order  of  the  observability  indices  iff  =0  for 
£,  <  £/. 

The  controllable  form  of  a  linear  system  is 

x  =  Ax—aCx  +  Bu  (18) 

y  =  (19) 

where  (A,  B,  C)  is  a  prime  triple  with  indices  ti, . . .  ,tm  and  a  and  7  are  arbitrary 
matrices  of  dimensions  n  x  m  and  p  x  n. 

The  following  facts  are  well-known  and/or  can  be  easily  proved.  A  system  in 
controllable  form  is  controllable  with  controllability  indices  £x , . . . ,  £m.  A  system 
(4),  (5)  can  be  transformed  into  controllable  form  (18),  (19)  by  a  linear  change 
of  state  coordinates  £  =  Tx  iff  it  is  controllable.  If  (4),  (5)  is  controllable  with 
controllability  indices  then  the  x  coordinates  of  (18),  (19)  are  defined 

by  taking  as  a  basis  the  columns  of  the  matrix  T 

T=  \Fe'-lGl,...,Gl,...,Ft"'-1Grn,...,Gm 


(20) 
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Let  x  =  T  have  components 

X  =  (in . 4lCii-"izmli-",zm(>l,)i  (21) 

where  *  denotes  transpose,  then  Fi>~rG 3  in  (  coordinates  becomes  the  unit 
vector  in  the  XjT  direction  in  x  coordinates.  The  jth  column  of  a  and  the  matrix 
7  are  given  by 


a1'  =  -T~lFe,G}  (22) 

7  =  HT.  (23) 

It  can  be  shown  that  a?r  =  0  if  t\  —  r  >  tj.  The  controllability  indices  are  strict 
iff  a\T  =  0  for  Li  —  r  >  l}-.  The  controllable  form  (18),  (19)  of  the  linear  system 
(4),  (5)  and  the  associated  x  coordinates  (20),  (21),  (22),  (23)  are  unique  up  to 
reordering  of  the  controllability  indices.  The  observable  form  of  a  linear  system 

is 


x  =  Ax—Bax+pu  (24) 

y  =  Cx  (25) 

where  ( A ,  B,C)  is  a  prime  triple  with  indices  li, . . . ,  Ip  and  a  and  P  are  arbitrary 
matrices  of  dimensions  p  x  n  and  n  X  m. 

A  system  in  observable  form  is  observable  with  observability  indices  Li,...,tp. 
A  system  (4),  (5)  can  be  transformed  into  observable  form  (24),  (25)  by  a  linear 
change  of  state  coordinates  £  =  Tx  iff  it  is  observable.  If  (4),  (5)  is  observable 
with  observability  indices  then  the  x  coordinates  of  (18),  (19)  are  of 


the  form 

x  =  (in,  •  •  •  >  xlli  >  •  •  •  i  xpU  •  •  •  i  xplf) 

(26) 

where  T~l  is  defined  by 

ui* 

i 

£ 

II 

(27) 

The  ith  row  of  a  and  the  matrix  P  are  given  by 

=  -HiFl'T 

(28) 

p  =  T~lG. 

(29) 

It  can  be  shown  that  a?r  =  0ifr>£,  +  l.  The  observability  indices  are 
strict  iff  ajr  =  0  for  r  >  (+.  The  observable  form  (24),  (25)  of  the  linear  system 
(4),  (5)  and  the  associated  x  coordinates  (26),  (27),  (28),  (29)  are  unique  up  to 
a  reordering  of  observability  indices. 
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The  controller  form  of  linear  system  is 

x  =  Ax  —  Bax  +  B0u  (30) 

y  =  Tf*  (31) 

where  (>4,  B,  C)  is  a  prime  triple  with  indices  and  a,  0,  7  are  matrices 

of  dimensions  m  x  n,  m  x  m,  p  x  n.  These  matrices  are  arbitrary  except  0  must 
be  non-singular. 

A  system  in  controller  form  is  controllable  with  controllability  indices  ti, . . . , 
im  and  the  controllability  indices  are  strict  relative  to  the  input  u  =  0u.  A 
system  (4), (5)  can  be  transformed  into  controller  form  (30),  (31)  by  a  linear 
change  of  state  coordinates  £  =  Tx  iff  it  is  controllable.  If  (4),  (5)  is  controllable 
with  controllability  indices  then  let  0  be  defined  by  (10).  One  can 

define  a  pseudo-output  for  (4),  (5). 


*=Kl  (32) 

where  K  is  the  m  x  n  matrix  defined  by  (9).  The  square  system  (4)  and  (32) 

is  observable  with  strict  observability  indices  . t,n.  The  observable  form 

realization  of  (4)  and  (32)  is  a  controller  form  realization  of  (4),  (5).  The  x 
coordinates  of  (30),  (31)  are  of  the  form  (20)  and 

*,>  =  KjFr~l£.  (33) 

The  matrix  7  is  given  by  (23)  and  the  ith  row  of  a  is  given  by 

aq  =  —KiF^T.  (34) 

Since  the  observability  indices  of  (4)  and  (32)  are  strict,  we  have 

of  =  0  r>  li.  (35) 

In  general  controller  form  realizations  are  not  unique.  However  the  controller 
form  realization  which  satisfies  (11)  and  (35)  is  unique  up  to  reordering  of  the 
controllability  indices. 

The  observer  form  of  a  linear  system  is 

x  =  Ax  —  aCx  +  0u  (36) 

y  =  7  Cx  (37) 

where  (A,  B,  C)  is  a  prime  triple  with  indices  llt . . . ,  Ip  and  a,  0,7  are  indices  of 
dimensions  n  x  p,n  x  m,p  x  p.  These  matrices  are  arbitrary  except  that  7  must 
be  non-singular. 
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A  system  in  observer  form  is  observable  with  observability  indices 
and  the  observability  indices  are  strict  relative  to  the  output  y  =  l~ly.  A 
system  (4),  (5)  can  be  transformed  into  observer  form  (36),  (37)  by  a  linear 
change  of  state  coordinates  £  =  Tx  iff  it  is  observable.  If  (4),  (5)  is  observable 
with  observability  indices  let  q  be  given  by  (16).  One  can  define  a 

pseudo-input  n 

i  =  Ft  +  Qn  (38) 

where  Q  is  the  n  x  p  matrix  defined  by  (15).  The  square  system  (38)  and  (5)  is 

controllable  with  strict  controllability  indices  ti,...,lp.  The  controllable  form 
realization  of  (38)  and  (5)  is  an  observer  form  realization  of  (4),  (5).  The  x 
coordinates  of  (36),  (37)  are  of  the  form  (26)  and  defined  by  £  =  Tx  where 

T  =  \Fe'-1Q1,...,Q1,...,Ft’-1Ap,...,Qp\.  (39) 

The  matrix  0  is  given  by  (29)  and  the  jth  column  of  a  is  given  by 

a3  =  -T~lFe’Q3.\  (40) 

Since  the  controllability  indices  of  (38)  and  (5)  are  strict,  we  have 

°£  =  0  1  <r<U-lj.  (41) 

In  general  observer  form  realizations  are  not  unique.  However,  the  observer 
form  realization  which  satisfies  (17)  and  (41)  is  unique  up  to  a  reordering  of  the 
observability  indices. 

Remark  2.1  The  controller  form  (SO),  (SI)  of  a  system  is  very  useful  tn  de¬ 
signing  a  linear  state  variable  feedback  to  stabilize  the  system.  The  observer  form 
(S6),  (ST)  is  very  useful  in  designing  asymptotic  observers.  Together  they  can  be 
used  to  stabilize  a  system  by  dynamic  output  feedback  (also  called  observer  based 
compensation).  See  [l]  or  [ 2 ]  for  details. 

Remark  2.2  Controllable  and  observable  forms  are  easier  to  compute  and  are 
useful  for  finding  the  observer  and  controller  forms  of  related  systems. 


3  MFD’s 

The  purpose  of  this  section  is  to  emphasize  the  very  close  relationship  between 
the  normal  forms  of  a  linear  system  described  above  and  the  so-called  polynomial 
matrix  fractional  descriptions  of  its  transfer  function.  For  linear  systems  it  is 
only  a  matter  of  personal  preference  which  representation  we  choose  to  work 
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with.  This  is  not  the  case  for  the  nonlinear  systems  because  they  don’t  have  nice 
frequency  domain  descriptions.  Our  treatment  is  similar  to  that  of  (l)  and  [2]. 

Throughout  we  shall  use  the  following  notation.  Given  the  indices  £1 . £, 

where  n  =  £j  H - h  iq  then  A(s),  $(s)  and  'I'(s)  are  block  diagonal  polynomial 

matrices  of  dimensions  jxj,  q  X  n  and  n  x  q  of  the  form 

(42) 

(43) 

(44) 

The  linear  state  space  description 

x  =  Ax  +  v  (45) 

z  =  x  (46) 

with  input  v,  state  x,  output  z,  all  of  dimension  n,  has  the  following  polynomial 
matrix  description  in  the  transform  domain 

A  (-)*(«)  =  *(-)«(»)  (47) 

z(s)  =  «(,)*(.)  (48) 

where  the  so  called  “partial  state*  £(s)  is  defined  by 

^(s)  =  Cz(s)  (49) 

or  equivalently 

x(s)  =  «(a)£(a)  (50) 

(Here  x(s)  denotes  the  Laplace  transform  of  x(t),  etc.).  The  ( A,B,C )  of  the 
above  are  a  prime  triple  with  indices  ti , . . . ,  £,  so  that 

C»(a)  =  =  /***.  (51) 

From  this  we  quickly  obtain  MFD’s  corresponding  to  the  4  normal  forms  of 
the  last  section.  For  a  system  in  controllable  form  (18),  (19)  we  use  the  relations 

t>(s)  =  -qCi(j)  +  Bu(») 

y(s)  =  'yz(s). 


A(s)  =  BlockDiag  [  se'  ]lxl 
<$(s)  =  BlockDiag  [  a*"-1 


1] 


lxt, 


'I'(s)  =  BlockDiag 


1 1  i  +  i 


(52) 

(53) 
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Let  g  =  m,  then  from  (47),  (48)  and  (49),  (50),  (51)  we  obtain 


u(s)  =  (A(s)  +  $(a)oO£(a)  (54) 

y(3)  =  i*(s)tts)  (55) 

which  is  a  RMFD  of  the  form 

y(s)  =  N(s)D~1(s)u(s)  (56) 

where 

D(a)  =  (A(s)  +  $>(a)a)  (57) 

iV(s)  =  7^(3).  (58) 


Given  a  RMFD  (56)  we  can  always  obtain  a  controllable  form  realization. 
Recall  that  a  polynomial  matrix  is  unimodular  if  it  has  an  inverse  which  is  a 
polynomial  matrix.  If  we  multiply  AT(s)  and  Z?(s)  on  the  right  by  a  unimodular 
matrix  we  don’t  change  the  transfer  function.  In  this  way  we  can  insure  that  the 
matrix  of  highest  column  coefficients  of  D  is  invertible  and  even  more  equals  the 
identity. 

Let  li , . . . ,  1™  be  the  column  degrees  of  D,  then  D(s)  and  N(s)  can  be  written 
as  (57),  (58)  thus  defining  a  and  7.  This  yields  a  controllable  form  realization 
of  (56). 

For  a  system  in  controller  form  (30),  (31)  we  use  the  relations 


t/(s)  =  —  Bax(s)  +  BPu(s)  (59) 

y(s)  =  7*(s)  (60) 

and  so  we  obtain  the  RMFD  (56)  where 

D[s)  =  rl(A(>)  +  a«(«))  (61) 

N(s)  =  7^(s).  (62) 


Of  course  we  can  go  backwards.  Given  the  RMFD  (56)  we  multiply  N( 3)  and 
D(s)  on  the  right  by  a  unimodular  matrix  so  that  the  matrix  of  highest  column 
coefficients  of  D[s)  is  nonsingular.  The  decomposition  (61),  (62)  defines  a,  @ 
and  ~i  of  a  controller  realization  of  the  transfer  function. 

For  a  system  in  observable  form  (24),  (25)  we  use  the  relations 

v(s)  =  —Bax(s)  +  fiu(s) 
y(s)  =  Cz(s) 


(63) 

(64) 
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We  obtain  the  LMFD 

y(s)  =  £>~l(S)N(S)u(s)  (65) 

where 

D(s)  =  A(s)  +  a«'(s)  (66) 

N{s)  =  9(s)/3.  (67) 

On  the  other  hand  given  a  LMFD  (65)  we  can  multiply  D(a)  and  N(s )  on 
the  left  by  a  unimodular  matrix  to  obtain  the  decomposition  (66),  (67).  This 
defines  a  and  y9  of  an  observable  form  realization. 

For  a  system  in  observer  form  (36),  (37)  we  use  the  relations 


v{s)  = 

—aCx(s)  +  0u(s) 

(68) 

y(«)  = 

'lCz(s) 

(69) 

which  lead  to  a  LMFD  (65)  where 

D(s)  = 

(A  (a)  +  $(s)a)7  1 

(70) 

N(s)  = 

(71) 

Given  the  LMFD  (65)  the  decomposition  (70),  (71)  defines  a,  f)  and  of  an 
observer  form  realization. 


4  Nonlinear  Observable  and  Controller  Forms 

Henceforth  we  focus  our  attention  on  the  nonlinear  system 

«  =  /(()  +  ?(£)«  (72) 

V  =  MO  (73) 

where  f  6  R",  u  €  Rm,  y  6  Rp  and  /,  g,  h  are  smooth  (<7°°)  functions.  We 
are  interested  in  (72),  (73)  in  some  open  connected  subset  M  of  the  state  space 
containing  the  nominal  operating  point 

We  introduce  some  terminology  and  notation.  The  Lie  derivatives  of  the 
function  fi,(£)  by  the  vector  fields  /(£)  and  gJ(£)  are  functions  defined  by 


MM(fl  =  Jfiom 

(74) 

(75) 
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Of  course  these  operations  can  be  iterated, 

L}(hi)  =  LJ(L}-l(hi)).  (76) 

The  differential  dhi  of  a  function  h,-  is  a  one  form  defined  by 

dhitf  =  (77) 

A  one  form  w  is  a  row  vector  field  or  more  precisely  a  C°°  linear  combination  of 
differentials. 

«(o  =  (“ 1  (o.  ■  •  -  ,«"(0)  =  E  mcmmo  (78) 

where  fc,  (£)  and  A;(£)  are  smooth  functions.  A  one  form  can  be  paired  with  a 
vector  field  (all  vector  fields  are  columns  unless  otherwise  stated)  to  obtain  a 
function 

(«./>(€) = «(o/(o  =  £>*  (o/ao-  (79) 

»=i 

A  vector  field  can  also  Lie  differentiate  a  form  to  obtain  another  one  form 

i/H=«||  +  (| f/r  (so) 

where  *  denotes  transpose.  In  particular 

MdM  =  d(MM)-  (si) 

A  vector  field  can  also  Lie  differentiate  another  vector  field  to  yield  a  third 
vector  field. 

=  [/,*»'!  =  f£(*)/(e)  -  I|(e)^(c).  (82) 

This  can  be  iterated, 

^/)HM*'(/)4  (83) 

The  operation  (82)  is  also  called  the  Lie  bracket  (82)  of  the  vector  fields  and  can 
be  thought  of  both  as  a  multiplication  and  as  a  differentiation.  This  is  evidenced 
by  the  following  Liebnitz-type  formula  called  the  Jacobi  identity 

=  +  (84) 

Moreover  the  pairing  (79)  satisfies  a  Liebnitz  formula  with  respect  to  Lie  differ¬ 
entiation 


£/«w.  g ’))  -  <L/(w), g3)  +  <w,  1/,^]) 


(85) 
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For  the  readers  unfamiliar  with  these  concepts  we  suggest  the  calculation  of  the 
above  definitions  and  formulas  in  the  linear  case  (4),  (5)  where 


m  =  Ft 

(86) 

9*(t)  =  G> 

(87) 

MO  =  Hit- 

(88) 

We  define 

£'  =  C°°{L’f-l(dhi)  :  t  =  l r=l,. 

..,0  (89) 

where  C00{  }  means  the  linear  span  over  C°°  coefficients. 

Such  a  collection  of 

one  forms  which  is  closed  under  addition  and  multiplication  by  C°°  functions 
is  called  a  codistribution..  We  denote  by  f  *(£)  the  linear  space  of  1  x  n  vectors 
obtained  by  evaluating  the  one  forms  of  tl  at  the  point  £. 

Given  indices  , . . . ,  Lp  we  define 

£t\ . tp  ~  C°°{LTf~l(dhi)  :  i  =  1 . .  r  =  l . I  A*,}  (90) 

and  i  (t)  the  vector  space  obtained  by  evaluation  of  these  forms  at  £. 

The  system  is  (72),  (73)  has  observability  indices  £j, . . . ,  around  if  + 
■•  •  +  Lp  =  n,  and 

dimension  C "(£)  =  n  (91) 

and 

etU)  =  £t\ . ,,(0  (92) 

for  t  —  1 . n  and  all  £  in  some  neighborhood  of  £°.  The  reader  who  has  done 

the  suggested  calculations  recognizes  (91)  as  a  generalization  of  (12)  and  (92)  a 
generalization  of  (13).  The  observability  indices  are  strict  if 

Llj  (dhi)  e  £e'  (93) 

for  i  =  1, . . .  ,p.  This  generalizes  the  linear  definition. 

The  set  of  observability  indices  of  (72),  (73)  is  uniquely  determined  by  h  and 
/  and  is  invariant  under  changes  of  coordinates  in  the  state  and  output  spaces. 
There  can  be  some  freedom  in  the  ordering  of  the  indices  even  when  the  ordering 
of  the  outputs  remains  fixed.  The  observability  indices  are  strict  iff  there  is  only 
one  ordering  satisfying  (92).  To  simplify  notation  we  restrict  our  attention  to 
systems  where  all  the  observability  indices  are  positive. 

Condition  (91)  could  be  called  zero  input  observability.  It  means  that  the 
state  £( t )  of  (72),  (73)  can  be  distinguished  from  its  neighbors  by  the  output  y(t) 
and  its  first  n—  1  time  derivatives  along  the  trajectories  near  £°  corresponding  to 
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u(t)  =  0.  Unlike  the  linear  case,  (91)  does  not  imply  the  existence  of  observability 
indices  satisfying  (92)  around  every  £°  but  only  for  a  generic,  (i.e.  an  open  and 
dense)  set  of  £0,s.  The  latter  condition  implies  that  the  functions 

xir  =  Ly1(hi)(t)  (94) 

for  t  =  l,...,p;  r  =  1, are  valid  local  coordinates  on  the  state  space. 
When  (72), (73)  has  observability  indices  around  a  point  £°  which  is  a  critical 
point  of  /,  (f  (Z°)  =  0)  then  they  agree  with  the  observability  indices  of  the 
linear  approximating  system  to  (72),  (73)  at  (°. 

The  observable  form  of  a  nonlinear  system  is 

x  —  Ax  —  J3ct(x)  +  f){x)u  (95) 

y  —  Cx  (96) 

where  (A,  B,  C)  is  a  prime  triple  with  indices  and  a,  ft  are  smooth 

m  x  1,  n  x  m  matrix  valued  functions  of  x. 

Proposition  4.1  .  A  nonlinear  system  in  observable  form  (95),  (96)  has  ob¬ 
servability  indices  t\,...,Lp.  A  nonlinear  system  (72),  (78)  can  be  transformed 
into  observable  form  (95),  (96)  by  a  change  of  local  coordinates  around  (°  iff 
the  system  has  observability  indices  around  £°.  If  (72),  (78)  has  observability 
indices  around  £°  then  the  x  coordinates  of  the  form  (26)  given  by  (94) 

transform  it  to  observable  form.  The  observable  form  of  a  nonlinear  system,  the 
associated  x  coordinates  and  the  nominal  x-operating  point  x°  =  T-1(£°)  are 
unique  up  to  a  reordering  of  the  observability  indices.  The  functions  a  and  f)  of 
the  observable  form  (95),  (96)  are  given  by 

Pi  =  LgiLyl(K). 

The  observability  index  assumption  (92)  implies  that 

dan  =  Lef(dhi)  6  f 4+1  (99) 

which  means  that  a,  does  not  depend  on  xjr  if  r  >  £,  +  1.  The  observability 
indices  are  strict  (98)  iff  a<  does  not  depend  on  a  xjr  if  r  >  £,  +  1,  »n  other 
words 

dai  =  L(f'{dhi)  e£e'.  (100) 

The  proof  of  this  result  is  relatively  straightforward,  for  example  see  [4],  section 

2. 


(97) 

(98) 
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We  now  turn  to  controllability  properties  of  (72),  (73).  We  define 

P(  =  C°°{  ad'-l(— /V:  j  =  1, ...  ,m;  r=l,...,L}.  (101) 

This  is  a  collection  of  vector  fields  closed  under  addition  and  multiplication  by 

C°°  functions;  such  an  object  is  called  a  distribution.  Given  indices 

let 

. ,m  =  {ad  r~l(-fW:  j  =  1 . .  r  A  l,}.  (102) 


The  system  (72),  (73)  has  controllability  indices  li,.. 

■  ■  ■  +  lm  =  n  and 

. ,  Lm  around  £°  if  Ly  + 

dimension  Pn(£)  =  n 

(103) 

and 

»*{!)  =  Pe\ . *Jf) 

(104) 

for  t  =  l,...,n  and  all  £  in  some  neighborhood  of  £°. 

Of  course  (103)  is  a 

generalization  of  (6)  and  (104)  is  a  generalization  of  (7).  The  controllability 

indices  are  strict  if 

«**'(-/)✓'  €  P&....€m 

(105) 

for  i  =  1 . n.  This  generalizes  the  linear  definition. 

The  set  of  controllability  indices  of  (72),  (73)  is  uniquely  determined  by  /  and 
g  and  is  invariant  under  change  of  coordinates  in  the  state  space  and  nonlinear 
state  feedback,  i.e.  u  =  a(x)  4-  P(x)\>  where  p(x)  is  m  X  m  invertible.  There 
can  be  some  freedom  in  the  ordering  of  the  indices  even  when  the  output  is 
fixed.  The  controllability  indices  are  strict  iff  there  is  only  one  ordering  satisfying 
(104).  For  notational  convenience,  we  restrict  our  attention  to  systems  where 
the  controllability  indices  are  positive. 

Condition  ( 103)  could  be  called  local  linear  controllability  for  if  £°  is  a  critical 
point  of  /,  (/(£°)  =  0)  then  the  linear  approximation  to  (72),  (73)  at  is 
controllable  iff  (103)  holds.  Once  again  (103)  does  not  imply  the  existence  of 
controllability  indices  satisfying  (104)  around  every  £°,  only  for  an  open,  dense 
set  of  £°’s.  When  (72),  (73)  has  controllability  indices  around  a  critical  point  £°, 
they  agree  with  the  controllability  indices  of  the  linear  approximating  system  to 
(72),  (73)  at  £°. 

The  controller  form  of  a  nonlinear  system  is 

x  —  Ax  -  Ba(x)  +  BP(x)u  (106) 

V  =  l(x)  (107) 

where  (A,  B,  C)  is  a  prime  triple  with  indices  , . . . ,  C.  and  a,  0}  7  are  smooth 
mxl,  m  x  m,  p  x  1  matrix  valued  functions  of  x  which  are  arbitrary  except 
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that  fl(x)  must  be  nonsingular.  The  question  of  when  a  nonlinear  control  sys¬ 
tem  can  be  transformed  to  controller  form  has  been  independently  solved  by 
several  authors  [5,6,7,8,9,22).  Some  only  considered  special  cases  like  m  =  1  or 
ft(x)  =constant.  Our  treatment  follows  Hunt  and  Su  |8). 

Recall  that  a  distribution  P  is  involuting  if  it  is  closed  with  respect  to  Lie 
bracket,  i.e.  [tf1,?2)  €  D  whenever  ql,q2  €  P.  Given  a  distribution  we  can 
consider  the  under-determined  systems  of  partial  differential  equations. 


(dk,  q)  =  0  forall  q  &  P 


(108) 


for  the  unknown  function  fc(£).  The  question  of  existence  and  uniqueness  of  local 
solutions  to  (73)  is  addressed  by  the  following. 

Frobenius  Theorem  Suppose  P  is  of  constant  codimension  d.  P  is  involutive 
iff  locally  there  exists  d  independent  solutions  kit...,kj  to  (73).  Any  other 
solution  k(£)  is  a  function  of  ki(^), . . . ,  kd(£)- 

Proposition  4.2  ([8],  see  also  [5,6,7,8,9,S£]).  A  nonlinear  system  in  controller 
form  (106),  (107)  has  controllability  indices  £1,...,4n  which  are  strict  relative 
to  the  input  u  =  /?u.  A  nonlinear  system  (7£),  (73)  can  be  transformed  into 
controller  form  (106),  (107)  by  a  local  change  of  coordinates  around  (°  iff  it  has 
controllability  indices  ilt . . . ,  tm  and  Pt>~1  is  involutive  for  j  —  1, . . . ,  m. 

Proof  The  proof  of  the  first  statement  is  a  straightforward  verification. 

As  for  the  second  suppose  (72),  (73)  can  be  transformed  to  controller  form 
by  £  =  T(x).  Using  the  C  matrix  of  the  prime  triple  we  define  a  pseudo-output. 


'l’  =  k(t)  =  CT-1(l;)  (109) 

then  the  function  k  satisfies 


Lg,L 


1  <  r  <  U 


(110) 

(111) 


Using  the  Liebnitz  formula  (85)  and  induction  we  can  show  that  (111)  is  equiv¬ 
alent  to 


(Lj(dki),adr~,~1(—f)g*)  =  | 


1  <  r  <  A, 

r  =  U, 


0  <  s  <  r 
0  <  s  <  r 


(112) 


FYom  this  it  follows  that  for  every  q  €  Pe 


{Ly'(dki),q)=  0 


(113) 
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for  i  =  1, . . .  ,n  and  r  =  1  —  t.  Moreover  from  the  invertibility  of  ft  it 

follows  that  the  functions  t  =  and  r  =  1  —  1}  are 

independent.  There  are  as  many  such  functions  as  the  codimension  of  Pl  so 
by  the  FVobenius  theorem  Pl  is  involutive  for  all  l  and  in  particular  for  l  = 

tj  1,  J  1,  .  .  .  ,  TYl. 

On  the  other  hand  if  Pe>  ~l  is  involutive  for  j  =  1, . . . ,  m  then  by  repeated  ap¬ 
plication  of  the  FVobenius  theorem  one  can  find  independent  functions  fci , . . . ,  km 
satisfying  (112)  where  ft  is  some  invertible  m  x  m  matrix  valued  function.  If  we 
define  x  coordinates  by 

X,  r  =  Lr/~l(kJ)(()  (114) 

for  j  =  1, . . . ,  m;  r  =  1, . . . ,  tj  then  these  coordinates  transform  the  nonlinear 
system  to  controller  form  (106),  (107).  The  functions  a  and  ft  are  given  by  ( 1 10) , 
(111). 

When  it  exists,  the  controller  form  of  a  nonlinear  system  is  not  unique.  FVom 
the  proof  of  the  above  we  see  that  the  controller  form  is  completely  determined 
by  the  choice  of  the  pseudo-output  A:(£)  satisfying  (ill)  for  some  invertible  /?(£). 
If  k(£)  is  another  solution  of  (ill)  then  (112)  and  ^  1 13)  imply  that  fc<(£)  is  a 
function  of  xyr  =  Lrf~i(kj)  for  tj  >  and  r  =  1, . . . ,  Lj  —  t\. 

Notice  that  the  nominal  operating  point  x°  =  T~l(£°)  of  the  controller  form 
is  determined  by  the  choice  of  k(£).  In  particular  there  exists  k  such  that  x°  =  0 
iff  there  exists  u°  such  that  /(£°)  4-  <?(£°)u°  =  0. 

Another  point  worth  mentioning  is  that  the  system  (72)  with  pseudo-output 
=  A:(£)  does  not  necessarily  have  observability  indices  This  would 

be  the  case  iff  in  addition  to  (112),  fc(£)  satisfies 

(Lji(dki),*dr~1(—f)g1)  =  0 

for  r  =  1,. ..  ,£y  -  ti  -  1. 

We  might  try  to  obtain  a  unique  controller  form  by  requiring  that  a  and  ft 
also  satisfy  the  nonlinear  generalizations  of  (ll)  and  (35),  namely 

ftHt)  =  S(  (115) 


doc{ 

dx}r 


=  0 


r>ti 


(116) 


But  this  would  reduce  the  number  of  nonlinear  systems  that  admit  a  controller 
form.  The  conditions  (115),  (116)  imply  that 


(«tti,«r-,(-/)/}  =  |  ° 


1  <  r  <  tj 
r  =  tj 


(117) 
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This  is  a  system  of  first  order  partial  differential  equations  for  the  unknown  func¬ 
tions  Jfci,  The  solvability  of  such  a  system  is  addressed  by  the  following. 

Integrability  Theorem  Let  91($)>  •  •  ■  >?"(£)  be  an  n  linearly  independent  n 
dimensional  vector  fields.  There  exists  a  solution  k  =  (fci, . . . ,  km)  to  the  system 


(dki 


V)«/  6i  =  — 

\  0  j  =  m  +  1, ...  ,n 


iff 

[q*,?5]  G  P  i,j  =  l,...,n 

where  D  is  the  distribution  spanned  by  {<jm+1 , . . . ,  ?n}.  The  solution  is  unique 
up  to  a  choice  of  fc(|°). 

From  this  theorem  we  see  that  there  exists  a  solution  to  (117)  iff 

l«dr-1(-/)ffi1  ad  -li-f)g*]el>  (118) 

for  t,y  =  l,...,n;  r  =  1 «  =  1  ,...,£/  where  P  is  the  distribution  given 
by 

P  —  C°°{adr-1(— Z)^  :  j  =  1,. r  =  1, . - . t/y  —  1>.  (119) 

Condition  (118),(119)  is  considerably  more  stringent  then  Pti~i  being  invo- 
lutive  for  j  =  1, . . . ,  m.  In  particular  suppose  we  consider  a  generic  nonlinear 
system  (72),  (73)  with  n  —  2  and  m  =  1.  Around  a  generic  point  £°,  the  vector 
fields  g1  and  ad (-f)g1  are  linear  independent  hence  such  a  system  has  a  con¬ 
trollability  index  t\  ~  2.  The  distribution  Px  —  ^{j1}  is  trivially  invohitive 
so  such  a  system  has  a  controller  form.  However  condition  (118),  (119)  which  in 
this  case  is 

\g\  ^(-f)g1)  6  C-fo1} 

is  not  genetically  satisfied. 

Suppose  (72),  (73)  has  controllability  indices  around  £°.  Regard¬ 

less  of  whether  or  not  it  admits  a  controller  form  around  £°,  it  is  always  pos¬ 
sible  to  make  the  controllability  indices  strict  by  a  change  of  input  coordinates 
u  =  /?(£)u  when  0* (£)  =  6-  for  Li  <  l3  as  in  the  linear  case.  We  define  one  forms 
“i(f).---,Wp(0  by 


(o,i,ad-1(-/)^)  =  {  °  1^L<titi  (120) 

FVom  this  and  the  controllability  index  assumptions  (104)  it  follows  that 
(w ».  adr_1(-/)^)  =0  tj  <r<  Li. 


(121) 
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Moreover  by  repeated  use  of  the  Liebnitz  formula  (85)  we  see  that  (120),  (121) 
is  equivalent  to 


(L 


1  <  r  <  tj  or  tj  <  r  <  U 
r  =  /y 


(122) 


We  define  ff  by 

#  =  <!,*- VO.  A  (123) 

Immediately  we  see  that  /?/  =  6-  for  U  <  l}  so  @  is  invertible.  It  is  not  hard 
to  show  that  the  system  defined  by  (/,  g)  =  (/,  gP~l)  has  strict  controllability 
indices  tu . . . ,  tm.  Notice  that  if  (117)  is  solvable  then  w,  =  dfc,. 


5  Nonlinear  Controllable  and  Observer  Forms 

The  controllable  form  of  a  nonlinear  system  is 

x  =  Ax  -f  a(Cx)  +  Bu  (124) 

y  =  7(z)  (125) 

where  (A,  B,  C)  is  a  prime  triple  with  indices  ti ,  lm  and  a,  7  are  smooth  matrix 
valued  functions  of  dimensions  n  X  1,  p  x  1.  Notice  that  a  is  a  function  of  the 
pseudo-output  ip  =  Cx  while  7  is  a  function  of  z. 

Notice  that  if  a(tp)  is  a  linear  function  of  ip  then  the  dynamics  (124)  of  the 
nonlinear  controllable  form  agree  with  the  dynamics  (18)  of  the  linear  control¬ 
lable  form.  Hence  the  question  of  the  existence  of  a  nonlinear  controllable  form 
is  closely  related  to  the  question  of  linearising  the  dynamics  (124)  by  a  change  of 
state  coordinates.  This  latter  question  has  a  long  history  going  back  to  Poincare 
[16],  For  more  recent  work  see  (17,18,19,20,21). 

For  the  most  part  the  controllable  forms  of  nonlinear  systems  have  not  ap¬ 
peared  explicitly  in  the  literature.  But  as  one  might  expect  they  have  arisen 
implicitly  in  some  of  the  work  on  observer  form  [4,10],  and  on  linearisation  [2 1). 
The  following  is  a  reformulation  of  similar  results  from  [10],  [21]  and  [22]. 

Proposition  5.1  A  nonlinear  system  »'n  controllable  form  has  controllability  in¬ 
dices  A  nonlinear  system  (72),  (73)  can  be  transformed  into  con¬ 

trollable  form  (124),  (185)  by  a  change  of  local  coordinates  around  £°  iff  it  has 
controllability  indices  l\, . . . ,  tm  and 

[adr-1(-/)yt,  ad"l(-/)^|  =  0  (126) 

for  i,j=  1 . m  and  r  =  1 . ti,  s  =  1, ...,<y  around  £°.  Controllable  form 

and  the  associated  x  coordinates  are  unique  up  to  a  choice  of  the  nominal  x- 
operating  point  x°  =  T~ 1  (£°)  and  up  to  reordering  of  the  controllability  indices. 
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The  dynamics  (72)  of  a  nonlinear  system  can  be  linearized,  or  equivalently, 
can  be  transformed  to  the  dynamics  of  linear  controllable  form  (18)  by  a  change 
of  state  coordinates  around  £°  iff  (72)  has  controllability  indices  and 

(126)  holds  for  i,j  =  and  r  —  1, . . . ,U  +  1;  s  =  1,...,/,  +  1. 

Proof  Consider  the  nonlinear  system  in  controllable  form  (124),  (125).  It  is  a 
straightforward  calculation  to  show  that 

adr-l(-Ax  +  a{^))B^  =  {  lrZr,-+{  ■  (127) 

l  av-y  r  -C;  -t- 1 

Hence  the  controllable  form  (124),  (125)  has  controllability  indices 
Moreover  if  (72),  (73)  can  be  transformed  to  (124),  (125)  by  a  change  of  state 
coordinates  then  clearly  (72),  (73)  must  have  the  same  controllability  indices 
and  (126)  must  hold. 

On  the  other  hand  suppose  (72),  (73)  has  controllability  indices 
and  (126)  holds.  By  the  integrability  theorem  of  Section  4  with  m  =  n,  we  can 
choose  coordinate  functions  it>(£),  * =  1, . . . ,  m;  r  =  1, . . . ,  U  such  that 

(dxir,  (— f)g1)  =  6(6)  (128) 

for  i,j  =  l,...,m  and  r  =  1,..  .,4,  s  = 

In  the  x  coordinates,  ad<J-*(— f)g>  becomes  the  unit  vector  in  the  direction 
Xj,  or  in  other  words 

dx 

— ad'>~'(-/)^  IP  (129) 

for  j  =  1 . m  and  s  =  1, . . . ,  tj,  where  A,  B  are  from  the  prime  triple  with 

indices  Zif. Let  f(x)  be  the  transform  of  /(£)  into  *  coordinates. 

/(*>  =  ||(«(*))/(«*)) 

Then  from  (129),  (130)  we  have  if  s  >  1 


(130) 
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_  t B3 


FVom  this  we  conclude  that 


(131) 


]{x)  =  Ax  +  a(Cx) 
where  CiX  -  Xu,  i  =  1, . . . ,  m. 

We  now  prove  the  last  part  of  the  theorem.  If  the  nonlinear  dynamics  (72) 
can  be  transformed  to  the  dynamics  (18)  of  linear  controllable  form  then  by 
the  above  it  must  have  controllability  indices  . . . ,  Cm  and  (126)  must  hold  for 
r  =  1  and  s  =  1  Moreover  we  see  from  (127)  that  ad t’{—f)g3 

must  transform  to  a  constant  vector  field  in  z  coordinates  so  (126)  must  hold  for 
r  =  1, . . . ,  U  +  1  and  s  =  1, . . . ,  lj  +  1. 

On  the  other  hand  if  (126)  holds  for  r  =  1 . t+  +  1  and  a  =  1, . . . ,  lj +  1 

then  ad tk(—f)gk  must  be  a  constant  linear  combination  of  the  frame  of  vector 
fields  {adr_1(— f)g',  t  =  l,...,m;  r  =  1, To  see  this,  suppose  for  some 
functions  Ajr(£) 

ad^-Z^EEad'-1  (-/)«,*  A*r. 

t=l r=l 

Bracketing  with  ad*_1(— /)<?  yields 

♦=1 r=l 

The  linear  independence  of  the  vector  fields  of  the  frame  implies  that  for  j  = 
1, . . . ,  m;  a  =  1, . . . ,  lj 

hence  A*r  is  a  constant.  By  (127)  this  implies  that 

m 

<*«>( VQ  =  E  QED. 

k=l 

Notice  that  it  is  more  difficult  for  a  nonlinear  system  (72),  (73)  to  have 
a  controllable  form  than  to  have  a  controller  form.  Clearly  conditions  (126) 
implies  that  Vl’~l  is  involutive  for  j  =  1, . . . ,  m.  This  extra  difficulty  is  partially 
explained  by  the  extra  freedom  afforded  by  p(x)  in  the  controller  form  which  is 
lacking  in  the  controllable  form.  Zeitz  defines  controllable  form  with  jd(x)  present 
(22).  There  is  also  more  freedom  in  the  a  of  the  controller  form  than  the  a  of  the 
controllable  form.  The  former  is  an  Rm  valued  function  of  R”  while  the  latter 
is  a  Rn  valued  function  of  Rm.  The  linear  terms  of  the  Taylor  series  expansion 
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have  the  same  number  of  degrees  of  freedom,  n  m,  but  there  are  more  degrees 
of  freedom  in  the  higher  order  terms  of  the  controller  form  that  the  controllable 
form.  In  particular  for  the  terms  of  order  2,  there  are  mn(n  +  l)/2  degrees  of 
freedom  in  the  former  and  nm(m  +  l)/2  in  the  latter. 

The  observer  form  of  a  nonlinear  system  is 

x  =  Ax  —  a(Cx)  +  ft(Cx)u  (132) 

y  =  i(Cx)  (133) 

where  (A,  B,C)  is  a  prime  triple  with  indices  and  a,  f)  and  7  are 

smooth  matrix  valued  functions  of  dimensions  n  X  1,  m  X  m  and  p  x  1.  They 
are  arbitrary  except  that  7  must  be  a  local  diffeomorphism.  We  let  y  —  Cx. 
Observer  form  is  useful  in  the  construction  of  asymptotic  observers 

x  =  Ax  +  a(y)  +  p(y)u  + M(y  -  Cx)  (134) 

with  linear  error  dynamics 

x  =  (A  -  MC)x.  (135) 

The  question  of  when  a  nonlinear  system  can  be  transformed  to  observer  form 
has  been  considered  by  several  authors  [4],  [10,11,12,13,14],  [22],  Most  treated 
only  special  cases  like  p  =  1  or  7  =identity.  The  general  solution  can  be  found 
in  [4].  The  approach  taken  in  [4]  is  similar  to  the  approach  described  above  for 
the  linear  case. 

Suppose  the  nonlinear  system  (72),  (73)  can  be  transformed  into  observer 
form  (132),  (133)  by  a  local  change  of  coordinates  around  £°.  Using  the  B 
matrix  of  the  prime  triple  we  add  a  pseudo-input  p  to  (132) 

x  =  Ax  —  a(Cx)  +  0(Cx)u  +  1 3/x.  (136) 

When  u  is  held  constant  at  0,  (136)  can  be  viewed  as  the  controllable  form 
relative  to  the  pseudo-input  /i. 

We  transform  (136)  back  to  ^-coordinates 

£  =  /(£)  +  s(£)u  +  <?(£)m 

which  defines  the  vector  fields  q  =  ql , . . . ,  q9.  These  vector  fields  satisfy 

{Lylm,  *»■)  =  {  °  •  (137) 

If  q  is  known  then  wc  can  recover  the  observer  form  by  choosing  local  coordinates 
Xir  to  satisfy 

(dxir,  ad €'"(— /)*»')  =  6>S: 


(138) 
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for  t,j  =  1, . . .  ,p,  r  =  1 . ti  and  s  =  1 Such  coordinates  exist  iff 


(ad*'- 

ad^-*(-/)^]=0 

(139) 

1,... 

i  j  9  1|  •  •  •  |  and 

(ad*’_r(— /)^*  ,g*\  =  0 

(140) 

for  t  =  1, . . .  ,p;  j  =  1, . . .  ,n  and  r  =  2 

Summarizing  the  discussion,  an  observer  form  (132,133)  of  (72),  (73)  exists 
iff  there  exists  a  change  of  coordinates  y  —  q(p)  on  the  output  space  and  vector 
fields  ql determined  by  q  via  (137)  such  that  (139), (140)  holds.  In  effect 
(139), (140)  constitute  an  overdetermined  system  of  partial  differential  equations 
for  the  change  of  coordinates  y  =  q(y)  on  the  output  space.  To  analyze  such 
equations  we  must  introduce  the  geometric  concept  of  a  Koszul  connection  on 
the  output  space.  Let  ^‘(y),t  =  1,2, .. .  denote  vector  fields  on  the  p  dimensional 
output  space.  A  Koszul  connection  on  y-space  is  a  mapping  A  from  pairs  of  such 
vector  fields  to  vector  fields. 


A:(*‘,^)~  A^>)  (141) 

This  mapping  is  linear  over  C°°  functions  in  the  first  argument  and  satisfies  a 
Liebnitz  formula  in  the  second  argument.  In  other  words  if  A ,(y)  and  fiy(y)  are 
smooth  functions  then 

=  £(A.m,A*<(^)  +  A  iLfbijW).  (142) 

i  *./ 

If  4>l(y),  ■  •  ■  i  ^p(y)  is  a  local  frame  of  vector  fields  then  A  is  completely  deter¬ 
mined  by  its  Christoffel  symbols  r]/(y)  relative  to  this  frame.  These  are  defined 
by  the  expressions 

V(^)  =  IIrfc4k-  (143) 

k 

Consider  a  second  frame  4>l , . . . ,  4>p  related  to  the  first  by 

(144) 

v=i 

and 

4>r  =  't*kyk 

k-  1 


(145) 
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where  A  =  (A?)  is  a  pxp  nonsingular  matrix  valued  function  of  y  and  A-1  =  (/r?). 
It  follows  from  (141,142)  (143)  and  (144,  145)  that 

W  =  E  MWr  +  E^VU)-  (146) 

P,0>T  PS 

A  Koszul  connection  A  has  zero  curvature  if  there  exists  a  frame  where  the 
Christoffel  symbols  are  zero.  From  equation  (146)  we  obtain  the  partial  differ¬ 
ential  equation  that  such  a  change  of  frame  must  satisfy.  It  is  more  conveniently 
written  in  matrix  notation  where  Tp  denotes  the  pxp  matrix  (Fp<7)  with  row 
index  r  and  column  index  o, 

O^r'A-'-h  V(A-1)  (147) 

or  equivalently 

V(A)  =  AP\  (148) 

The  integrability  condition  for  this  is 

L ^  (A)  —  (A)  -  Z,j^<ri0P](A)  —  0  (149) 

or  equivalently 

A(rpr"  -  rrp  +  v  (r*)  -  v  (rp)  -  E  <£prT)  =  0  (iso) 

r 

where  Cpp  are  the  structural  coefficients  of  the  frame 

l^,^)  =  EC'r^T-  (151) 

T 

The  coefficient  of  A  in  (150)  is  the  curvature  of  A. 

It  is  convenient  to  work  with  frames  of  vector  fields  arising  from  coordinates 
on  the  output  space.  Suppose  y  and  y  are  two  different  coordinate  systems  and 
<f>  and  ^  are  the  associated  frames,  i.e. 


These  frames  are  related  by  the  chain  rule 


(152) 


SO 


A  =  dij/dy. 


(154) 
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A  Koszul  connection  A  is  flat  if  there  exists  a  coordinate  frame  for  which  the 
Christoffel  symbols  are  sero.  Such  coordinates  are  said  to  be  flat  relative  to  the 
connection.  Suppose  rf®  are  the  Christoffel  symbols  relative  to  coordinates  y. 
Clearly  we  can  find  new  coordinates  y  where  Christoffel  symbols  are  zero  iff  we 
can  solve  the  pair  of  partial  differential  equations  (147,148)  and  (154). 

We  rewrite  these  as 

/-A  =  Ar  (155) 

9y. 

~  =  A.  (156) 

9y 

The  integrability  condition  for  the  first  is  the  zero  curvature  condition  (150) 
which  can  be  rewritten  as 

npi  or; 

rjr  -  r  r>  +  2 —  —  =  o  (157) 

oyy  <3y. 

for  t,  j  —  1, . . .  ,p.  The  integrability  condition  for  the  second  is 

if  “  rk  =  °-  (158) 

The  left  side  of  (158)  is  called  the  torsion  of  the  connection  A.  In  summary 
a  Koszul  connection  is  flat  (i.e.  has  Christoffel  symbols  zero  relative  to  some 
coordinate  frame)  iff  it  has  zero  curvature  (157)  and  zero  torsion  (158). 

Suppose  $  are  flat  coordinates  for  a  flat  connection  A.  It  follows  from 
(155,156)  that  another  set  of  coordinates  y  is  flat  iff  y  and  y  are  affinely  re¬ 
lated,  i.e.  for  some  constant  invertible  matrix  A 

y=  Ay  +  y°. 

The  relevance  of  the  above  for  the  problem  of  transforming  a  system  to 
observer  form  is  explained  by  the  following  lemmas. 

Lemma  5.1  Suppose  the  nonlinear  system  (72, IS)  has  one  distinct  observability 
index  t  =  ti  =  ...  =  lp  of  multiplicity  p.  Define  vector  fields  ql , . . . ,  <f  by 

(LJ-1(</y<),^)={  °  l-rl~tjU  (159) 

Define  p3  functions  r^J(£)  by 

If  =  J(L/(dyfc),[ad<-1(-/)v’,ad?-2(-/)?’]). 


(160) 
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Let  y  —  y(y)  be  a  change  of  output  coordinates  and  Q  =  Q1 be  vector  fields 
defined  by  (1ST).  Define  another  p3  function  ry(f)  by 

rjf  =  -t(Lf(d9k),  [ad^1(-/)3*,ade-2(-/)^]>.  (161) 

Then  r^J  and  t])  are  related  by 

fit  =  v  dy?  dy°  d9*  roo  ,  y  dyp d &  9  , dv * 

*  e^r  dy<  dy> dyr  r  iz  d9i  dyr  ay»  d9> 

The  proof  of  this  lemma  can  be  found  in  [15].  Notice  that  the  lemma  asserts 
that  17  transform  like  the  Christoffel  symbols  of  a  connection  on  the  output 
space,  not  that  they  are  Christoffel  symbols.  If  (f )  are  actually  only  functions 
of  y  then  they  define  a  connection  on  the  output  space  and  this  connection  is 
independent  of  the  choice  of  output  coordinates. 

Lemma  5.2  Suppose  the  nonlinear  system  (72,73)  has  one  distinct  observability 
index  t  —  li  =  ...  =  Ip  and  can  be  transformed  to  observer  form  (132,188)  then 
17  (£)  defined  by  (160)  are  functions  only  of  y  and  define  a  flat  connection  on 
the  output  space. 

Proof  We  compute  the  symbols  given  by  (161)  where  5  =  Cx  are  the  trans¬ 
formed  output  coordinates  of  the  observer  form.  The  vector  fields  5(f)  defined 
by  (143)  transform  to  B  in  x  coordinates.  By  induction  we  obtain 


f  Ar~1Bi  1  <  r  <  l 
ad  1(-A*  +  a(5))Bi  =  j  r  =  l+ 1 

(163) 

FYom 

[ad^-Ax  +  a($))B\  ad  l~2(-Ax  +  <*(s))B>]  = 

[Ae~1Bi,  At~2B1]  =  0. 

(164) 

It  follows  that 

r* 

II 

p 

(165) 

If  If  are  defined  by  (160)  then  (162)  and  (165)  shows  that  they  are  functions 
of  y  alone  and  can  be  transformed  to  zero  by  a  change  of  output  coordinates. 
Hence  they  define  a  flat  connection  on  the  output  space.  QED. 

FYom  these  lemmas  we  immediately  obtain  the  following  theorem  [15]. 

Theorem  1  Suppose  the  nonlinear  system  (72,73)  has  one  distinct  observability 
index  l  =  ti  =  . . .  =  tp  around  f°.  It  can  be  transformed  to  observer  form  around 

e  ^ 
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the  17  (£)  defined  by  (160)  are  functions  only  of  y,  hence  define  a  Koszul 
connection  on  the  output  space 

this  connection  is  fiat 

for  any  flat  coordinates  y  on  the  output  space  the  vector  fields  defined  by 
(1S7)  satisfy  the  commutative  conditions  (139,140). 

Consider  a  system  with  one  distinct  observability  index  l  =  £1  =  . . .  =  Ip 
which  is  in  nonlinear  observable  form,  i.e. 

*'“(*<0  r=t  <166> 

for  t  =  1, . . . ,  p  and  r  =  1, . . . ,  t. 

The  vector  yields  g1 , . . . ,  cf  defined  by  (159)  are  just  the  unit  vectors  in  the 
directions  £1  The  TJ?  defined  by  (160)  are  given  by 


It- 


-1 

l 


_d _ d_ 

dij2  d&t 


/*(«■ 


(167) 


The  change  of  output  coordinates  y  =  q_1  (y)  must  satisfy  the  partial  differential 
equations  (155,156)  or 

a  ,agP.  _  zl  y'  dyp  92  f* 

dyidy/  t  ^  dyk  dij2diit' 

The  integrability  conditions  for  this  are  the  zero  curvature  condition  (157)  or 

v~*  a*  u  a2/p 

P  dipldtil  d£jt 

<(— g& _ 

and  the  sero  torsion  condition  (158)  or 

d2fk  = _ 

d(y3dtn  d(,3d(je- 

If  these  are  satisfied  then  we  can  solve  (168).  If  (139,140)  are  satisfied  then  we 
can  solve  (138)  to  find  the  1  coordinates  of  observer  form. 

Needless  to  say  this  is  a  very  tedious  process.  There  is  a  necessary  condition 
that  a  system  in  nonlinear  observable  form  (166)  must  satisfy  to  be  transformable 
to  observer  form.  We  define  the  degree  of  the  variable  &r  to  r  -  1  and  the  degree 


d2U  a2/, 


£ _  _ 


d£p2d£it  dtridZii 

d2  fa  \ 


aindir^dijt 

a2fk 


(169) 


/ 1  nr\\ 
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of  a  product  of  such  variables  to  be  the  sum  of  the  degrees  of  its  factors.  If  (166) 
can  be  transformed  into  observer  form  then  /,  (£)  must  be  a  polynomial  of  degree 
at  most  l.  We  refer  the  reader  to  [4]  for  a  proof  of  this.  In  particular  if  l  =  2 
then  this  degree  condition,  the  zero  curvature  condition  (169)  and  (139,  140)  are 
necessary  and  sufficient.  The  torsion  free  condition  (170)  is  trivially  satisfied.  It 
follows  from  (169)  that  (139)  need  only  be  checked  for  r  =  s  =  1  and  i  ^  j. 

If  p  =  1  then  trivially  the  curvature  and  torsion  are  zero  and  (168)  reduces 
to  a  first  order  linear  ordinary  differential  equation  for  the  quantity  dy/dy.  It 
is  solvable  if  the  degree  condition  on  /i  is  satisfied.  In  particular  when  p  =  1 
and  1  =  2  the  degree  condition  and  (140)  are  necessary  and  sufficient  for  the 
existence  of  observer  form. 

We  now  discuss  the  case  where  there  are  several  distinct  observability  indices. 
The  general  approach  is  as  before.  To  find  the  observer  form  of  (72,73)  if  it  exists 
we  seek  an  appropriate  change  of  output  coordinates  y  =  7_1(y)  which  allows 
us  to  define  vector  fields  q  via  (137).  If  (139,140)  are  satisfied  then  we  can  solve 
(138)  for  the  x  coordinates  of  the  observer  form. 

The  presence  of  several  distinct  observability  indices  complicates  the  search 
for  y  and  forces  us  to  proceed  in  stages.  Notice  that  for  a  system  in  observer 
form  the  observability  indices  are  strict  for  the  output  y  =  Cx.  This  is  because 

L\-Al-aM)(dy)  =  CA'-'  modf-1 

and  the  output  indices  are  strict  for  the  pair  C,  A.  So  any  nonlinear  system  that 
admits  an  observer  form  must  admit  a  change  of  output  coordinates  which  make 
the  output  indices  strict.  Moreover,  the  problem  of  transforming  a  nonlinear 
system  with  strict  observability  indices  into  observer  form  is  greatly  simplified 
by  the  following  fact. 

A  change  of  output  coordinates  $  =  •y~1(y)  preserves  the  order  and  strictness 
of  the  observability  indices  iff 


dy, 

dy} 


=  0  for  £,  <  lj. 


(171) 


To  find  a  change  of  output  coordinates  which  make  the  observability  indices 
of  (72,73)  strict  we  start  by  defining  vector  fields  q1,...,qp  via  (159). 

It  follows  by  the  standard  induction  argument  using  the  Liebnitz  formula 
(85)  that  (143)  implies 


(dyi,  adr~l  (-f)q3) 


0  1  <r<U 

PI  r  =  L, 

0  Li  <  r  <  t} 


(172) 
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Moreover  the  vector  fields  adr  1  {—f)q}  j  =  1, ...  ,p;  r  —  1, . . . ,  l}  form  a  frame 
of  n  independent  vector  fields.  These  characterize  £ e  as 

£e  =  {adr~1(-/)q’  :  j  =  l . .  r  =  1, . . .  ,t}  -  t}  1 

£ 1  —  {oneforms  u>  :  («,  adr-1(— f)<?)  =  0  j '=  1, . . . ,  p;  r  =  1, — 4}. 

(173) 

Suppose  y  =  Tf  1  (y)  is  a  change  of  output  coordinates  which  preserves  the 
ordering  of  the  observability  indices.  The  observability  indices  are  strict  relative 
to  the  y  output  iff 

L^(dyi)e£t’  (174) 

or  equivalently  by  (173) 

<l£((£ft)adr-1(-/)ff,‘>=  0  (175) 

for  r  =  1, . . . ,  I,  —  ti.  By  induction  and  the  Liebnitz  formula  this  is  equivalent 
to 

<dfc,ad'-1(-/)?y)=0  (176) 

for  r  =  tj.  Since  dy  =  di)/dy  dy,  (172)  implies  that  (176)  must  hold 

for  r  =  1, . . . ,  tj  —  1  also.  We  have  shown  that  the  observability  indices  are  strict 
relative  to  the  output  y  iff  (176)  holds  for  r  =  1, . . .  ,l}-  —  1  when  tj  <  and  for 
r  =  1  when  t,  >  U- 

We  define  p  distributions 

yi  =  C°°{adr-1(— /)g*  :  r  =  1,. . .  ,tj  -  1  if  t}  <  U  and 

(177) 

As  we  have  just  seen  a  change  of  coordinates  y  =  i~1(y)  preserves  the  ordering 
of  the  observability  indices  and  makes  them  strict  iff 

dgi±yii=  1 . .  (178) 

This  is  an  underdetermined  system  of  first  order  PDE’s  for  y.  By  employing  the 
FYobenius  Theorem,  we  obtain  the  following  reformulation  of  Theorem  4.2  in  [4]. 

Proposition  5.2  Suppose  the  nonlinear  system  (72,73)  has  observability  indices 

1 . . tp  around  £°.  There  exists  a  local  change  of  output  coordinates  y  — 

1  (y)  which  preserves  the  ordering  of  the  observability  indices  and  makes  them 
strict  iff  the  distributions  yl,...,yp  are  involutive. 

Lemma  5.S  Suppose  the  nonlinear  system  has  strict  observability  indices  llt . . . , 
tp  and  l  =  min{£i, . . . ,  tp).  Define  vector  fields  q  by  (159)  and  symbols  by 
(160)  then 

r'fcy  =  0  if  ^  >  t  or  tk  >  l. 
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Proof  Equation  (160)  can  be  rewritten  as 

“  ^ad*-,(-/)*i-t«d«-1(-/)*<^/(!fc)-  (179) 

By  (159) 

^ad 1{yk)  =  |  g£  ^  =  / 
so  the  first  term  on  the  right  of  (179)  is  always  zero.  If  As  >  i  then 

=  {  s’  £  Vt  + 1 


so  r*fcy  =  o. 

Suppose  Ik  =  t  and  A  >  l.  Then  q'  €  <f  ^  so  by  the  strictness  assumption 
(174,175)  it  follows 

^ad£-1  (— /)$•  Af  (yk)  =  LtiLj(yk)  =  0 


so  rjf  =  o. 

Lemma  5.4  Suppose  the  nonlinear  system  has  strict  observability  indices  A ,  •  •  •  > 

ip  and  t  —  min{fi . ip}.  Define  vector  fields  q  by  (159)  and  symbols  T)?  by 

(160).  Let  g  =  g(y)  be  a  change  of  coordinates  among  those  outputs  of  lowest 
observability  index,  i.e. 

=  6’  if  A  or  L  >  L  (180) 

dy, 

Define  q  by  (1ST)  and  rjf  by  (161)  then  and  are  related  as  Christoffel 
symbols  (162). 

The  proof  of  this  b  similar  to  that  of  Lemma  5.2,  see  [15], 

Lemma  5.5  Suppose  the  nonlinear  system  (72,73)  has  strict  observability  in¬ 
dices  i\,...,ip  and  i  =  min{£i,...,fp}-  U  (72,73)  admits  an  observer  form 
(132,133)  then  the  r^  (£)  defined  by  (160)  are  functions  only  of  y  and  define  a 
flat  connection  on  the  output  space. 

Proof  By  Lemma  5.6  we  know  that  =  0  if  A  >  A  or  lk  >  l.  So  all  we  need 
to  show  b  the  existence  of  an  observer  form  for  (72,73)  implies  the  exbtence  of 
a  change  of  coordinates  among  those  outputs  of  lowest  observability  index  (180) 
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which  to  transform  the  17  to  sero  for  U  =  £3  =  Ik  —  t.  But  it  is  clear  from  the 
proof  of  Lemma  5.3  that  if  we  were  to  compute  the  17  defined  by  (160)  for  a 
system  in  observer  form  then  they  are  sero. 

By  Lemma  5.7  the  17  for  ti  =  lj  =  Ik  —  t  transform  like  Christoffel  symbols 
under  a  change  of  coordinates  among  those  outputs  of  lowest  observability  index. 
By  (180)  the  change  of  output  coordinates  to  observer  form  y  —  q-1(y)  transform 
the  outputs  of  lowest  observability  index  among  themselves  and  can  be  used  to 
take  17  to  sero  for  t+  =  /,•  =  Ik  =  t~ 

If  I7(£)  defined  by  (160)  are  the  Christoffel  symbols  of  a  flat  connection  on 
the  output  space  then  we  can  solve  the  partial  differential  equations  (155,156) 
to  find  flat  coordinates  y.  These  coordinates  are  not  necessarily  the  y  of  the 
observer  form  if  it  exists.  But  at  least  those  of  lowest  observability  index  are 
because  of  (171).  We  change  notation  and  denote  the  flat  coordinates  by  y.  ' 

The  next  stage  is  to  find  the  next  smallest  distinct  observability  index  P  = 
min{4  >  £}.  We  define  new  symbols 

1?  =  ^(^Mad^-^-W.ad^-^-/)^]))  (181) 


where  Pi  =  P  A  P. 

It  is  not  hard  to  see  by  an  ai  0uinent  similar  to  Lemma  5.7  that  rjf  =  0  if  £ 
or  lj  >  P.  Moreove  if  ti  =  l}  =  Ik  =  t  then  the  17  of  (181)  are  just  l/P  times 
the  17  of  (160).  The  latter  are  sero  by  our  choice  of  flat  output  coordinates. 

For  reasons  explain..1  below,  if  the  system  admits  an  observer  form  then  17 
defined  by  (181)  define  a  flat  connection  on  the  output  space.  If  this  is  so  then  we 
solve  (155,156)  for  new  flat  output  coordinates  9.  Because  of  the  above  remarks 
the  change  of  coordinates  will  satisfy 

=  61  if  4  =  P  or  1,  >  P. 
dy, 


We  continue  on  in  this  fashion  until  we  have  exhausted  the  list  of  observability 
indices  or  found  symbols  which  do  not  define  a  flat  connection.  If  the  latter  does 
not  happen  then  the  last  flat  coordinates  y  are  the  desired  output  coordinates 
of  the  observer  form.  The  observer  form  will  exist  if  (139)  is  satisfied  for  r  =- 
1  i  =  l . <,■  and  (140)  holds  for  r  =  2 . j  =  l . m. 

To  see  why  this  approach  is  valid  consider  a  system  (72),  (73)  which  can  be 
transformed  into  observer  form.  Using  Lemmas  5.7  and  5.8  we  can  assume  that 
i li  =  y*  for  those  outputs  of  lowest  observability  index  =  l.  Assuming  that 
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u  =  0  we  have 


Vi  =  Sn  yi  =  *<i 

ft  l  =  ft'2  Xil  =  Zi2  —  Qt|< 


£«<=/•(£)  Xil  =  -<Xil 
By  comparing  these  we  arrive  at 


(ir  =  Xir-'E(£tr—i*u 


and 


(182) 


(183) 


(184) 


We  add  dummy  state  variables  £,r,  x,>  for  r  =  /+ 1, . . .  ,£  to  (182)  as  follows 


y»  f»i  y»  —  xn 

Sil  =  ft'2  Xil  =  Xi2  —  01  it 

Sit  —  Si  *+i  +  /i(0  xit  =  xil+ 1  -  ait  (185) 

Si  t+1  =  Si  1+2  Xi  t+ 1  =  Xi  i+ 2 

Sit1  =  0  =  0 

It  is  not  hard  to  see  using  (184)  that  these  are  transforms  of  each  other  under 
(183)  and 

Sir  =  Xir  l<  V  <£ .  (186) 

Hence  if  the  original  system  (182)  can  be  transformed  to  observer  form  and 
y,  =  j 'a  then  so  can  the  modified  system  (185).  Moreover  for  the  modified 
system  the  smallest  observability  index  is  now  1‘  rather  that  L  so  we  can  apply 
Lemmas  5.7  and  5.8.  It  is  a  straightforward  calculation  to  show  that  the  symbols 
of  the  modified  system  defined  by  (160)  with  l  replaced  by  l'  are  the  same  as 
those  given  by  (181). 
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Abstract:  One  traditional  approach  in  the  analysis  and  design  of 
nonlinear  control  systems  is  a  first  order  approximation  by  a  linear 
system.  A  new  approach  is  to  use  nonlinear  change  of  coordinates 
and  feedback  to  construct  linear  approximations  that  are  accurate  to 
second  and  higher  orders.  However,  the  algebraic  calculations 
required  to  obtain  these  aproximations  are  somewhat  lengthy.  In 
this  paper,  the  theoretical  framework  for  finding  such  change  of 
coordinates  for  a  nonlinear  system  are  described.  A  software 
package  that  symbolically  solves  these  transformations  is  currendy 
being  prepared. 


1.  Introduction 

There  is  no  general  method  for  dealing  with  all  nonlinear 
systems  because  nonlinear  differentia!  equations  are  virtually  devoid 
of  a  general  method  of  attack.  A  well-known  and  straightforward 
way  to  analyze  nonlinear  control  systems  is  to  obtain  a  linear 
approximation  of  the  plant  dynamics  around  a  nominal  operating 
point  and  design  a  feedback  law  for  the  resulting  linear  system.  If 
the  nonlinearities  are  strong,  this  approximation  is  valid  for  only  a 
limited  range  of  the  operating  regime,  and  performance  degradation 
or  loss  of  stability  of  the  control  system  may  occur  as  the  system 
moves  away  from  the  nominal  point.  Then  it  may  be  necessary  to 
repeat  the  linearization  and  design  a  new  controller  for  the  updated 
linear  representation.  This  process  is  repeated  as  often  as  necessary, 
as  dictated  by  the  nonlinearities  in  the  plant. 

Another  approach  is  to  feed  some  nonlinear  correction  terms 
into  the  linearized  plant  model  to  compensate  for  the  inaccuracies 
involved  in  the  approximations.  However,  it  is  usually  not 
straightforward  to  find  such  correction  terms.  Poincare's  theory  of 
normal  forms  produces  a  fruitful  technique  for  transforming  a 
nonlinear  vector  field  to  a  simpler  form  in  the  neighborhood  of  an 
equilibrium  point.  Another  method  employed  in  robotics  is  the 
cancellation  of  all  the  nonlinear  terms  by  feedback.  Alternatively, 
with  the  method  of  linearizing  transformations  one  seeks  a  change  of 
coordinates  and  state  feedback  to  transform  the  nonlinear  system 
into  a  linear  one.  Various  forms  of  this  question  have  been 


addressed  by  Brockett  [8],  Hunt  and  Su  [3],  Jakubczyk  and 
Respondek[4],  Sommer  [9],  and  Krener  [1,5].  The  concepts  on 
which  the  present  paper  is  based,  and  the  necessary  and  sufficient 
conditions  for  the  existence  of  a  solution,  have  been  treated  by 
Krener  in  [2], 

The  method  proposed  is  to  find  a  nonlinear  change  of 
coordinates  for  a  nonlinear  system  to  construct  a  linear 
approximation  of  the  plant  dynamics  accurate  to  second  or  higher 
order.  Based  on  these  more  accurate  approximations  one  should  be 
able  to  design  controllers  that  give  improved  performance  over  a 
wider  range  of  operating  conditions.  The  computations  required  to 
calculate  these  transformations  are  somewhat  complicated.  As 
suggested  in  [2],  this  difficulty  may  be  overcome  with  the  aid  of  a 
symbolic  algebraic  computation  package.  The  goal  of  this  paper  is 
to  describe  the  theoretical  framework  for  finding  the  required 
transformations. 

2.  Linearizing  Transformations 

Let  us  consider  a  nonlinear  system  in  which  the  control  u 
enters  the  dynamics  in  a  linear  fashion: 

x  =  f(x)  +  g(x)u  (la> 

x(0)  =  x*.  Ob) 

where  x  e  SR"  and  u  e  SRm.  The  system  is  assumed  to  be  at  rest  at 
the  nominal  operating  point  (x*;  u*  =  0).  For  brevity  of  the 
expressions  we  will  assume  x’  =  0.  The  calculations  can  be  easily 
extended  to  the  case  x"  *  0.  First,  consider  the  linearization  of  (1) 
at  x": 

x  =  Ax  +  Bu  (2a) 

A  =  — (0),  B  =  g(0).  (2b) 

Dx 
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We  will  seek  a  coordinate  change  for  (1 )  of  the  form  identity  plus 


So  (9)  can  be  wrinen  as: 


higher  order  terms,  such  that  the  resulting  linear  plant  will  agree 
with  ( l)  up  to  an  error  of  order  0(x,u)P  '  (i.e.  terms  of  (Xx^'and 
0(x)Pu)  where  p  is  the  degree  of  approximation.  Obviously,  Eqn. 
(2)  results  when  p  =  1.  In  the  following,  the  case  for  p  =  2  will  be 
derived  and  the  results  will  be  generalized  to  any  arbitrary  order  p  by 
induction. 

We  assume  a  transformation  of  the  form: 

(2) 

x  =  z  +  d  (z)  (3) 

where  z  denotes  the  new  set  of  coordinates.  <j>®  is  a  polynomial  of 
degree  2,  the  monomial  coefficients  of  which  are  to  be  found. 

The  time  derivative  of  (3)  yields: 

.  .  aAz). 

x  =  z  +  — z  (4) 


z  =  Az  +  Bu  +  f*\z)  -  {Az,4>(2)(z)]  +  g^(z)u  -  |Bu,0^(z)! 

+  0(z,u)3  (10) 

(2) 

With  the  following  choice  for  <t>  all  the  second  order  terms  of  (10) 
will  vanish  and  the  approximation  will  be  accurate  to  second  order 


f(2)(z)  = 


[Az4.(2)(z)] 


g(l)(z)u  =  (Bu,^2\z)) 


which  must  hold  for  all  constant  u.  Eqn.  (11a)  is  called  the 
Homological  Equation  [6).  A  solution  to  (1 1)  has  to  be  found  by 
using  the  freedom  in  the  choice  of  u;  we  use  a  feedback  of  the 
following  form  [2]: 

u  =  a(2)(x)  +  (I  +  P(lW  (12) 


We  solve  for  the  differential  equation  in  the  new  coordinates  z: 

i-oA-i 

oz 


To  evaluate  (5),  the  functions  f(x)  and  g(x)  are  expanded  in  a  Taylor 
series,  and  (3)  is  introduced: 

f(x)  =  f^x)  +  f®(x)  +  O(x)3 

=  f(1)(z  +  <l.(2)(z))  +  f(2)(z)  +  0(z)3 

=  Az  +  A<{i(2\z)  +  f(2)(z)  +  O(z)3  (6) 

g(x)  =  g(0)(x)  +  g(l)(x)  +  O(x)2 

=  B  +  g(1\z)  +  O(z)2  (7) 


where  cl  ;(x)  is  an  mxl  vector  of  order  2  polynomials,  I  is  an  mxm 
identity  matrix,  and  P^\x)  is  an  mxm  matrix  of  first  order  terms.  A 
new  input  in  the  linearized  coordinates  is  designated  as  v.  Note  that 
v  agrees  with  u  to  first  order.  With  the  introduction  of  this 
feedback,  f  and  g  of  Eqn.  (1)  arc  redefined: 

f(x)  =  f(x)  +  g(x)a(2)(x)  (13a) 

g(x)  =  g(x)  +  g(x)P(l)(x)  (13b) 

The  Taylor  expansion  of  (1 3)  yields 

f(x)  =  Ax  +  Ba^(x)  +  f(2)(x)  +  O  (x)  (14a) 

g(x)  =  B  +  g(I)(x)  +  Bp0)(x)  +  02(x)  (14b) 


W  -1 

The  term  ( I  + - )  in  (5)  is  expanded  in  a  series  around  z  =  0 

dz 


-,.(2)  (2)  (2) 

dz  dz  dz 

Then,  combining  (6),  (7),  and  (8)  in  (5)  and  expanding  we  get: 

(2)  a)  j<*><2)  (i)  a<t>(2) 

z  =  Az  +  Bu  +  (z)  +  f  (z) - Az  +  g  (z)u  — ~ —  Bu 

dz  dz 

+  0(z,u)3  (9) 

Now  we  intrixlucc  some  notation.  The  Lie  bracket  of  two  vector 
fields  is  another  vector  field  defined  by: 

tf.gi  A-f  g 

dx  dx 


7  (2)(x)  =  Ba(2)(x)  +  f(2)(x) 
g(1)(x)  =  g0)(x)  +  Bp0)(x) 


Reiterating  the  steps  of  Eqns.  (3)  through  (1 1)  we  find: 

f(2)(z)  =  (Az,t>(2)(z)]  (16a) 

g0)(z)  =  [Bu,4>(2)(z)l  (16b) 

The  distinction  between  Eqns,  (11)  and  (16)  is  seen  when  (16)  is 
rewritten  as: 


f(2)(z)  =  -  Ba(2)(z)  +  |Az,i>(2)(z)1 
g(l)(z)v  =  -  Bp(1)(z)v  +  [Bv,0(2\z)1 


In  the  generalized  homological  equation  of  (17),  the  second  order 
terms  f^2\z)  and  g^\z)v  can  be  cancelled  out  under  certain 


solvability  conditions  by  proper  choice  of  <j/2\z),  a^(z),  and 

P*'\z).  For  this  second  order  linearization  we  have  a  system  of 

2  2  .2 
n  (n  +  l)/2  +  n  m  linear  algebraic  equations  in  n  (n  +  l)/2  +  mn(n 

2 

+  l)/2  +  m  n  unknowns.  When  a  solution  can  be  found,  the 
resulting  system  becomes: 

z  =  Az  +  Bv  +  0(z,v)3  (18) 

In  order  to  find  an  approximation  of  the  next  higher  order,  we 
rewrite  (18)  by  reverting  to  the  variables  x  and  u: 

x  =  Ax  +  Bu  +  0(x,u)3  (18)' 

Now  we  are  asuming  that  in  the  given  nonlinear  system  the  second 
order  terms  have  been  already  been  canceled  as  outlined  above. 
Then  we  seek  a  new  transformation  of  the  form: 

x  =  z  +  <t>(3\z)  (19) 

Note  that  transformation  (19)  will  not  introduce  any  terms  of  degree 
less  than  3.  Then  the  same  procedure  outlined  above  is  repeated, 


with  the  feedback: 

U=a<3)(x)  +  (I  +  P(2)(x))v  (20) 

which  results  in: 

(®(z)  =  -  Ba(3)(z)  +  [Az,<j>(3)(z)]  (21a) 

gm(z)v  =  -  Bpa)(z)v  +  [Bv,4>{3)(z)]  (21b) 

These  results  can  be  generalized  as  follows.  Given  a  system  which 
is  accurate  to  only  order  p-1,  i.  e. 

x  =  Ax  +  Bu  +  0(x,u)P  (22) 

a  coordinate  change  is  sought  as: 

x  *  z  +  ^(z)  (23) 

along  with  feedback: 

u  =  a<p)(x)  +  (I  +  p(,Hl)(x))v  (24) 

which  yields  the  homological  equation  to  be  solved: 
f*P)(z)  =  -  Boc(p)(z)  +  [ Az,«J><P)(z))  (25a) 

g^Wv  =  -  B0(J>~!)(z)v  +  (Bv.O^Vz)!  (25b) 


I  A(p)  f(p)  „(p)  (P_I)  J  nip-1)  , 

In  (25),  <5  .  f  ,  a  ,  g  and  p  are,  respectively, 

homogeneous  vector  fields  of  orders  corresponding  to  their 
superscripts.  The  resulting  system  is  accurate  up  to  order  p: 

z  =  Az  +  Bv  +  0(z,v)K  (26) 


3.  Linearizing  Transformations  for  Systems  with  Small 
Parameters 

In  this  section,  we  consider  a  control  system  of  the  form: 

x  =  f(x,e)  +  g(x,e)u  (27a) 

x(0)  =  x\  (27b) 

where  £  is  a  small  parameter  that  characterizes  the  way  parasitic 
effects  or  disturbances  enter  into  the  system.  We  will  develop  a 
method  of  linearizing  transformation  for  this  type  of  system,  similar 
to  that  of  Section  2.  First,  (27)  is  expanded  as  follows: 

x  =  Ax  +  Bu  +  e(fJ'1)(x)  +  g(1)(x)u)  +  0(e)2  (28) 

In  (28),  the  nonlinear  function  is  expanded  and  grouped  in  powers 
of  e.  Thus,  the  superscripts  of  f  and  g  now  correspond  to  the 
powers  of  £  these  functions  multiply,  in  contrast  with  the  notation  of 
Section  2.  A  coordinate  change  is  assumed  of  the  following  form: 

x  =  z  +  e<t/'\z)  (29) 

where  the  form  and  the  polynomial  order  of  the  function  <p^\z)  is 
not  determined  yet  Repeating  the  calculations  similar  to  the  steps  of 
Eqns.  (4)  through  (9)  of  Section  2  yields: 

z  =  Az  +  Bu  +  £(/1)(z)  -  [Az,4>(I)(z)]  +  g0)(z)u  -  [Bu,0O)(z)]) 


+  0(e)2  (30) 

An  input  for  the  control  system  of  Eqn.  (28)  is  chosen  as: 

u  =  v  +  e(a(1)(x)  +  P0)(x)v)  (31) 

After  a  sequence  of  calculations  similar  to  Eqns.  (13)  through  (17), 
the  homological  equations  are  found: 

f^z)  =  -  Ba0)(z)  +  [Az,<50)(z)]  (32a) 

g(1)(z)v  =  -  Bp0)(z)v  +  [Bv,<j»(1)(z>]  (32b) 

This  result  can  be  generalized  for  an  arbitrary  power  of  e  in  the  same 
fashion:  A  solution  to 

f^z)  =  -  Bot^fz)  +  [Az,(J)(P)(z)]  (33a) 

g(P)(z)v  =  -  Bp(p)(z)v  +  (Bv.^U)]  (33b) 

will  yield 

z  =  Az  +  Bv  +  0(e)P*'  '  (34) 


Even  though  Eqns.  (33)  and  (25)  look  very  similar,  there  are  some 
fundamental  differences.  All  the  variables  in  Eqn.  (33)  have 
different  definitions  than  those  of  Eqn.  (25),  as  mentioned  at  the 


beginning  of  this  section.  Moreover,  the  solvability  conditions  of 
(33)  are  not  the  same  as  the  conditions  of  Eqn.  (25).  Actually,  both 
(32)  and  (33)  may  represent  an  infinite  family  of  equations  as 
opposed  to  the  finite  dimensional  set  of  expressions  that  arise  from 
(25). 

Any  nonlinear  system  expressed  in  the  form  of  in  Eqn.  (1) 
can  always  be  transformed  into  the  form  of  (27)  as  follows:  First, 
consider  the  expansion  of  ( 1 )  as 

x  =  Ax  +  f(2)(x)  +  Bu  +  g^'\x)u  +  O(x.u)3  (35) 

Scale  the  coordinates  and  the  input  with: 

,  -I 

4  =  e  x 

-1 

U  =  e  ii 

introducing  the  above  into  (35)  yields 

k  =  A^  +  Bp  +  e(  f®(Q  +  gl)($)p )  +  O(e)2  (36) 

This  equation  is  of  the  form  of  Eqn.  (28),  except  for  the  difference 
in  the  way  expansions  of  f  and  g  are  defined.  We  use  the  overbar 
notation  to  emphasize  this  point.  The  input 

u  =  a(2)(x)  +  0  +  P(°(x))v  (12) 

is  also  transformed  with  an  additional  scaling  q  =  e  v: 

p  =  B  +  e(a(2)(f.)  +  P(U(^)ti)  (3?) 

With  this  scaling  of  coordinates,  a  linearization  problem  given  as  in 
Section  2  can  be  alternatively  solved  with  the  procedure  outlined  in 
this  section. 


4.  Form  of  the  Nonlinear  Compensation 

After  a  higher  order  linearization  is  obtained,  the  next  step  is 
to  choose  a  feedback  law  to  achieve  closed-loop  pole  assignment. 
Consider  the  approximation  of  Section  2  where 


x  =  Ax  +  Bu  +  0(x,u)P  (22) 

has  been  transformed  by  the  coordinate  change 
x  =  z  +  <t>^(z)  (23) 

and  feedback 

u  =  c/PVx)  +  (I  +  p<p-l,(x))v  (24) 

into 

z  =  Az  +  Bv  +  Ofz.v)**1  (26) 


A  closed-loop  pole  assignment  can  be  made  with  state  feedback  of 
the  form 

v  =  Fz  +  r  (38) 

where  r  is  an  open  loop  control.  The  approximation  of  (26)  then 
becomes 

z  =  (A  +  BF)z  +  Br.  (39) 

Notice  that  (23)  agrees  with  the  identity  transformation  up  to  order 
p- 1 ,  so  it  is  easily  inverted  at  least  up  to  order  p: 

z=x-4>(P)(x)  (40) 

The  input  u  of  Eqn.  (24)  in  the  original  coordinate  becomes,  with 
the  aid  of  (22),  (38),  and  (40): 

u  =  a(p)(x)  +  (I  +  P(P_1)(x))(F(x  -  4>(P)(x))  +  r) 

=  Fx  +  r  +  {a(p>(x)  +  p(^‘)(x)(Fx  -  F4,(p)(x)+  r ) 

-F^Vx)}.  (41) 

Thus  the  control  function  has  the  form  of  a  pole  assignment  for  the 
linear  part  of  (22)  plus  some  correction  terms  of  higher  order 
(grouped  in  the  bracket  of  Eqn.  (41)).  This  result  clearly  shows  the 
purpose  and  nature  of  the  nonlinear  feedback. 


6.  Condusion 

In  this  paper  we  have  presented  an  alternative  approach  to 
the  analysis  and  design  of  nonlinear  control  systems.  The  procedure 
consists  of  finding  a  coordinate  change  by  an  appropriate  feedback 
to  achieve  higher  order  linear  approximations  to  nonlinear  systems. 
Because  of  space  limitations,  we  have  not  presented  the  details  of 
the  solvability  conditions.  The  method  of  solving  for  the  linearizing 
transformations  is  based  on  the  normal  forms  approach  of  Poincar6, 
which  is  a  widely  used  technique  in  the  analysis  of  bifurcations  in 
nonlinear  vector  fields.  This  suggests  the  applicability  of  these 
powerful  bifurcation  methods  in  nonlinear  control  systems  analysis. 
Aeyels  [10]  and  Abed  and  Fu  [11]  have  studied  the  local 
stabilization  problem  for  nonlinear  systems  with  this  approach.  In 
other  words,  the  method  is  an  appropriate  tool  for  the  analysis  of 
nonlinear  systems  in  which  plant  parameter  variations  cause 
fundamental  changes  in  the  structure  of  the  system.  Another 
important  issue  is  the  following:  When  a  solution  exists,  the 
functions  a^(x),  P^'^x),  ^\x)  are  not  necessarily  unique.  The 
question  of  what  is  the  best  choice,  or  even  what  is  a  reasonable 
choice  among  the  possible  solutions  needs  more  investigation. 

The  equations  that  need  to  be  solved  for  finding  the 
transformations  are  a  set  of  linear  algebraic  equations.  However, 


the  number  of  equations  grow  rapidly  with  increasing  orders  of 
linearization  and  with  higher  dimensional  systems.For  example,  for 
a  second  order  linearization  and  with  n  states  and  m  inputs  we  have 

"fi?  *  2 

a  system  of’n  (n  +  l)/2  +  n  m  linear  algebraic  equations  in 
n  (n  +  l)/2  +  mn(n  +<l)/2  +  m  n  unknowns.  With  the  use  of 
symbolic  algebraic  manipulation  packages  and  with  the  availibility  of 
increasingly  powerful  computers,  this  is  not  considered  as  a  serious 
setback.  A  symbolic  algebra  program  that  automatically  solves  these 
transformations  on  the  computer  is  in  preparation. 


References 


[1]  Krener,  A.  J„  "On  the  Equivalence  of  Control  Systems  and 
the  Linearization  of  Nonlinear  Systems",  SIAM  J.  Control  11 
(1973)  670-676. 

[2]  Krener,  A.  J.,  "Approximate  Linearization  by  State  Feedback 
and  Coordinate  Change",  Systems  and  Control  Letters  S  (1984) 
181-185. 

[3]  Hunt,  L.  R.  and  R.  Su,  "Linear  Equivalents  of  Nonlinear 
Time  Varying  Systems",  Proceedings  MTNS  Symposium,  Santa 
Monica  (1981)  119-123. 

[4]  Jakubczyk,  B.  and  W.  Respondek,  "On  the  Linearization  of 
Contr  '  Systems",  Bull.  Acad.  Polon.  Sci.  Ser.  Math.  Astron. 
Physics  28  (1980)  517-522. 

[5]  Krener,  A.  J.,  "New  Approaches  to  the  Design  of  Nonlinear 
Compensators",  Proceedings  of  the  Berkeley  Ames  Conf.  on 
Nonlinear  Problems,  Aerodynamics  and  Flight  Control  (Math.  Sci. 
Press,  1984). 

[6]  Arnold,  V.  I.,  Geometric  Methods  in  the  Theory  of  Ordinary 
Differential  Equations  (Springer  Verlag,  New  York,  1983). 

[7]  Guckenheimer,  J.  and  P.  Holmes,  Nonlinear  Oscillations, 
Dynamical  Systems,  and  Bifurcations  of  Vector  Fields  (Springer 
Verlag,  New  York,  1983). 

[8]  Brockett,  R.  W.,  "Feedback  Invariants  for  Nonlinear 
Systems"  Proceedings,  IF  AC  Congress,  Helsinki  (1978). 

[9]  Sommer,  R.,  "Control  Design  for  Multivariable  Non-linear 
Time-varying  Systems",  Int  J  Control  31  (1980)  883—891. 

[ 10]  Aeyels,  D.,  "Stabilization  of  a  Class  of  Nonlinear  Systems  by 
a  Smooth  Feedback  Control",  Systems  and  Control  Letters  5  (1985) 
289-294. 

[11]  Abed,  E.  H..  and  J.  H.  Fu,  "Local  Feedback  Stabilization  and 
Bifurcation  Control,  I.  Hopf  Bifurcation",  Systems  and  Control 
Utters  7  (1986)  11-17. 


9 


AKfcR.TR.  <j  9  ••  1  639 

Determining  Torque  and  Velocity  Limits  on  Joint  Actuators  for 
Robot  Manipulators  with  Coupled  Joint  Motion* 


Sinan  Karahan 

Department  of  Mechanical  Engineering 
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Abstract:  In  robots  with  remotely  driven  links,  the 
relationships  between  some  ot  the  actuator  and  Joint 
motions  are  coupled.  In  this  paper  robot  dynamic  equations 
for  these  types  of  manipulators  are  analyzed.  The 
equations  are  expressed  in  actuator  coordinates  by  means 
of  the  coupling  matrix  between  joint  and  actuator 
coordinates.  The  motivation  is  that.  In  the  actuator 
coordinates,  the  actuator  velocity  and  torque  Imits  can  be 
calculated.  The  procedure  is  demonstrated  by  an  example. 

Keywords:  Robot  manipulator,  torque  limit, 
coupled  motion,  robot  dynamics. 


1.  Introduction 

In  the  formulation  of  manipulator  dynamic  equations, 
two  well-known  methods.  Newton-Euler  [4]  and  Lagrangian 
[1  .SI  have  been  the  most  popular.  While  the  Newton-Euler 
method  is  computationally  more  efficient,  it  Is  the 
Lagrangian  method  that  allows  better  insight  into  the  the 
dynamics,  as  well  as  into  the  analysis  and  design  of  control 
systems  for  manipulators.  In  the  Lagrangian  formulation, 
the  Denavit-Hartenberg  convention  [2]  Is  often  used  to 
assign  link  parameters.  Within  this  framework,  the  Joint 
coordinates  are  measured  with  respect  to  the  next  lower 
link  in  the  open  kinematic  chain,  regardless  of  the 
mechanical  characteristics  of  the  robot.  While  for  the  more 
recent  direct  drive  robot  designs  there  is  a  one-to-one 
relationship  between  Joint  and  actuator  coordinates  except 
tor  the  gear  ratio  conversions  ( thus  the  name  ’direct  drive"), 
conventional  type  manipulators  generally  have  remotely 
driven  links  due  to  limitations  on  cost,  weight,  and  other 
design  criteria  In  such  robots,  the  relationships  between 
some  of  the  actuator  and  Joint  motions  are  coupled,  with  an 
actuator  rotation  or  dtoplacement  resulting  in  the  motion  of 
more  than  one  link.  Avoidance  of  the  upper  bounds  on 
actuator  torques  or  forces  is  a  concern  for  high  speed 
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Research  under  grant  number  85-0267. 


motions  and  optimal  control  schemes,  because  the  end 
effector  of  the  robot  will  not  be  able  to  follow  the 
commanded  trajectory  if  the  actuator  limits  are  exceeded. 
Formulating  the  equations  tor  remotely  driven  manipulators 
in  Joint  coordinates  will  result  in  complicated  calculations  for 
the  upper  bounds  tor  some  of  the  Joints,  thus  making  it 
difficult  to  check  the  torque/force  imits  at  the  actuator  level 
with  given  motion  commands  at  the  Joint  level. 

The  goal  of  this  paper  is  to  reformulate  the 
Lagrangian  dynamic  equations  for  robots  In  actuator 
coordinates,  and  to  show  that  the  problem  of  checking  for 
the  torque/force  upper  bounds  can  be  simplified. 

2.  Manipulator  Dynamic  Equatlona  and  their 
tranaformatlona 


The  equations  of  motion  for  an  n-Mnk  open  chain 
manipulator  arm  are  given  in  vector-matrix  form  as: 


D(8)8  + 


e^e 

-e'c„e- 


♦  Q(0)  -  T 


(D 


where  8, 8,  8  are,  respectively,  (nxl)  Joint  displacement, 
velocity,  and  acceleration  vectors.  D(8)  represents  the 
(nxn)  configuration  dependent  inertia  matrix.  Ct.  C2, . .  ,Cn 

are  the  (nxn)  Coriois-  centrifugal  force  coefficient  matrices, 
G(6)  is  the  (nxl)  vector  of  gravitational  forces  acting  on 
each  Joint,  and  t  is  the  vector  of  forces  applied  by  the 
actuators.  We  note  her*  that,  for  a  manipulator  with  Joint 
motion  coupling,  x  Is  composed  of  the  forces  appled  to  the 
Joints,  not  the  actuator  output  forces.  For  the  purposes  of 
the  derivations  to  be  performed  later,  It  is  preferable  to 
rewrite  (1 )  In  the  following  form: 
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In  (2),  the  symbol  ®  represents  the  Kronecker  product  of 
matrices,  and  1n  Is  the  {nxn)  identity  matrix.  Obviously,  the 

matrix  [  l„  ®  6*1  has  dimensions  (nxn2)  and  the  Coriolis- 
centrifugal  coefficient  matrix  of  C,'s  Is  (n^n).  We  refer  the 
reader  to  [3]  as  an  excellent  reference  on  Kronecker 
algebra  of  matrices. 


Let  the  coupling  between  joint  and  the  actuator 
displacements  be  given  by: 

6-H«p  (3) 

where  is  the  (nxl)  vector  of  actuator  displacements,  and  H 
is  the  (nxn)  coupling  matrix.  H  is  constant  and  non-singular. 
For  a  direct-drive  robot  arm,  H  would  reduce  to  a  diagonal 
matrix  with  the  corresponding  gear  reduction  ratios  as  the 
entries.  By  differentiating  (3)  with  resocct  to  time,  we  obtain: 


(4) 


and 

8  -  H«p  (5) 

On  the  other  hand,  the  relationship  between  the  actuator 
forces  and  the  joint  forces  is: 

x.-H't  (6) 

where  t,  is  the  force  Input  vector  in  actuator  coordinates. 


Introducing  the  relations  (4)  and  (5)  into  the 
equations  of  motion  (2)  yields: 
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Multiplying  each  side  of  Eqn.  (7)  by  H*  we  get: 
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In  the  above,  we  first  note  that  the  right  hand  side  Is  the 
same  as  Eqn.  (6),  i.e.  xr  By  using  the  identity  (A®B)(E®F)  - 


(AE®BF)  (this  Is  possible  whenever  A,  E  and  B.  F  are 
conformable  pairs;  see  (31).  we  rewrfte  the  term  involving 
the  Kronecker  product  as: 


H*  1 1#  ®  ^H*  1  *  1 1„  ®  s?  ]  (H*  ®  H1  ]  (9) 

The  term  [H*  ®  H*j  can  also  be  written  as  (H*)'2',  the 
superscript  in  brackets  being  the  notation  for  Kronecker 
square  of  matrices  [3].  With  this  result,  Eqn.  (6)  becomes: 
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where  the  parameters  corresponding  to  the  actuator 
coordinates  have  been  assigned  the  subscript  *a*. 
Comparing  with  Eqns.  (8)  and  (9),  these  are: 
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Gravitational  forces 


and  the  torque  term  Is  given  by  Eqn.  (6). 

3.  Application  to  velocity  and  torque  limit 
calculations 


For  a  practical  implementation,  consider  the  problem 
of  calculating  torque  commands  to  the  actuators  for  a  given 
trajectory  to  be  followed.  Let  the  maximum  force/torque  he 
actuators  can  deliver  be  x4m„.  Additionally,  there  is  usualy 

a  maximum  velocity  at  which  the  actuators  can  be  driven, 
which  we  denote  by  em„.  For  a  typical  DC  motor,  the 
maximum  torque  is  Rmited  by  the  maximum  current  rating  of 
the  motor,  while  the  maximum  velocity  is  proportional  to  the 
voltage  applied.  The  actuator  dynamic  effects  are  ignored 
in  this  context.  We  assume  that  the  sensors  placed  at  the 
actuator  or  joint  level  measure  the  displacement  and  the 
velocity  and  that  the  acceleration  can  be  approximated  by: 


(pi  -  (p 

-  At 


where  superscripts  represent  the  tth  and  i-lst 
measurements,  and  At  is  the  sampling  period.  We  need  to 
calculate  the  left-hand  side  of  Eqn.  (10)  with  these  velocities 
and  accelerations.  This  will  yield  the  total  forces/torques 
required  at  each  joint  to  achieve  the  motion  Commanded. 
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Call  this  quantity  A.  Comparing  tha  vector  A  obtained  from 
this  calculation  with  the  actuator  limits  term  by  term, 
which  are  the  maximum  forces/torques  that  are  available,  It 
is  possible  to  determine  whether  any  of  these  limits  are 
exceeded.  Whenever  a  torque  or  velocity  limit  is  exceeded, 
one  needs  to  renormalize  the  actuator  commands  in  the 
following  manner: 


mechanical  connection.  For  this  robot,  Eqn.  (3)  becomes: 
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and  if  N  >  1 ,  in  which  case  either  a  torque  or  velocity  limit  is 
exceeded,  then  set 

ii  .  , 

¥ww  =  FT  U1-n 


In  the  above,  k,.  kj.  kj,  are  the  gear  transmission  ratios 
between  the  actuators  and  the  Joints.  By  taking  the  time 
derivative  of  (11)  the  counterparts  of  Eqns.  (4)  and  (5)  are 
also  obtained.  For  the  purposes  of  the  foregoing 
calculations  the  explicit  forms  of  the  inertia  matrix  elements 
and  the  other  terms  in  Eqn.  (2)  will  not  be  necessary. 
Therefore,  the  Inertia  matrix  Is  expressed  in  its  symbolic 
form: 


This  will  renormalize  all  the  joint  velocity  commands  with 
the  same  ratio,  thus  ensuring  that  the  manipulator  end 
effector  stays  on  the  commanded  path,  even  though  at  a 
slower  velocity.  Using  such  an  algorithm,  if  a  limit  is 
reached,  usually  only  one  of  the  actuators  will  be  at  its 
maximum. 

4.  An  Example 

In  this  section,  the  calculations  of  the  joint-actuator 
coordinate  transformations  will  be  demonstrated  with  a 
simple  example.  Consider  the  3  d-o-f  revolute  manipulator 
of  Fig.  1: 


Calculation  of  the  Inertia  matrix  in  actuator  coordinates 
yields: 
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Similarly,  the  Coriolis-centrifugal  terms  are  calculated  as: 


The  coordinate  system  on  the  robot  and  link  parameters  are 
assigned  in  accordance  with  the  Denavit-Hartenberg 
notation.  It  is  assumed  that  joint  3  is  driven  remotely  from 
the  base,  and  that  the  motion  is  transmitted  through  a 


■c.r 

■c,- 

.  (H1 »  H'l 

c2 

H- 

LcJ 

Lc3J 

H^H 

H*(C2-C3)H 


*2 

fcj 

*3 


h*c3h 


Note  that  in  the  above  C,  are  (n  x  n)  matrices  of  Coriolis- 
centrifugal  force  coefficients.  The  gravitational  terms 
become: 
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And,  Anally,  the  torque  terms  are  transformed  as: 
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5.  Conclusion 

In  this  papsr,  manipulator  dynamics  aquations 
^formulated  in  joint  coordinates  were  transformed  into 
actuator  coordinates  by  use  of  the  actuator-joint  coordinate 
coupling  matrix.  This  resulted  in  the  expressions  for  the 
transformation  of  Inertia  and  Coriolis-centrifugal  terms,  as 
presented  in  Section  2-  Then,  the  final  form  of  the  equations 
allows  an  easier  implementation  of  an  algorithm  that  checks 
the  velocity  and  torque  Imits,  and  renormalizes  the  actuator 
commands  so  as  to  prevent  tracking  errors. 
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Application  of  An  Efficient  Nonlinear  Filter* 

Ruggero  FREZZA ,  Sinan  KARAHAN,  Arthur  J.  KRENER,  Mont  HUBBARD 

Abstract-  In  this  paper  we  present  an  application  of  a  new  filtering  technique  based  on  geometric  linearization 
and  asymptotic  analysis.  The  technique  is  compared  to  the  conventional  extended  Kalman  filter  to  demonstrate  its 
computational  efficiency. 

Introduction 

While  the  theory  of  linear  filtering  has  been  well  developed  and  understood,  practical 
nonlinear  filtering  has  typically  relied  on  heuristic  techniques.  One  of  these  is  the  extended  Kalman 
filter  which  is  based  on  a  linear  approximation  of  the  system  equations  around  a  trajectory.  The 
asymptotic  geometric  nonlinear  filtering  technique  was  developed  by  Krener  in  [5].  It  is  based  on 
the  so-called  observer  normal  form  which  linearizes  the  dynamics  by  a  change  of  state  coordinates 
and  output  injection.  The  development  of  the  observer  normal  form  is  due  to  Krener  and  Isidori 
[1],  Krener  and  Respondek  [2],  Bestle  and  Zeitz  [8],  Zeitz  [11,12],  Fritz  and  Keller  [9],  Keller 
[6,7],  and  Li  and  Tao  [10]. 

The  approach  is  geometric  and  consists  of  finding  a  change  of  coordinates,  in  general 
nonlinear,  such  that  the  equations  of  the  system  transform  to  their  most  linear  form.  Once  the 
system  is  in  "nearly  linear"  form  it  will  be  possible  to  apply  asymptotically  the  theory  of  linear 
filtering.  This  has  many  advantages.  In  fact  it  is  possible  to  formally  define  optimality  and 
asymptotic  stability.  Moreover,  the  gains  of  the  filter  may  be  computed  off-line  because  the  Riccati 
differential  equation  is  independent  of  the  states.  This  reduces  the  on-line  computational  burden  of 
the  filter. 

The  only  drawback  of  the  technique  is  that  the  change  of  coordinates  generally  requires  a 
very  heavy  algebraic  computational  effort.  One  solution  to  this  is  to  use  already  existing  software 
packages  for  symbolic  computations.  Naturally  the  technique  is  not  applicable  to  all  nonlinear 
systems;  otherwise  we  would  have  discovered  that  everything  in  nature  is  linear  in  some 
appropriate  coordinate  set.  The  class  of  "nearly  linearizable"  systems  is  substantial,  and.gives  to 
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generally  requires  a  very  heavy  algebraic  computational  effort.  One  solution 
to  this  is  to  use  already  existing  software  packages  for  symbolic 
computations.  Naturally  the  technique  is  not  applicable  to  all  nonlinear 
systems;  otherwise  we  would  have  discovered  that  everything  in  nature  is 
linear  in  some  appropriate  coordinate  set.  The  class  of  "nearly  linearizable" 
systems  is  substantial,  and  gives  to  the  method  a  certain  flavor  of  generality. 

In  this  paper  we  will  illustrate  the  technique  for  a  specific  example.  We 
will  estimate  the  height,  the  velocity  and  the  ballistic  coefficient  of  a  falling 
object  in  an  atmosphere  with  variable  density.  We  will  also  implement  the 
extended  Kalman  filter  for  the  same  problem  and  compare  the  two  filtering 
techniques  in  terms  of  performance  and  computation  time.  Throughout  the 
paper  we  will  refer  to  the  geometric  asymptotic  nonlinear  filter,  the  new 
technique,  as  the  GANF  and  to  the  extended  Kalman  filter  as  the  EKF. 

1-  Brief  description  of  the  EKF  and  the  GANF  techniques 

We  say  that  a  given  nonlinear  system  without  input: 

C  =  f(0, 
y  =  h(C) . 

where  ^  e  Rn  and  y  e  R*,  is  observable  if  it  can  be  transformed  into 
observable  normal  form.  This  corresponds,  in  some  sense,  to  the  property 
of  observability  for  a  linear  system.  Normal  forms  have  the  advantage  of 
making  transparent  the  effect  of  an  input  on  the  dynamics  of  the  system. 
There  are  four  such  normal  forms:  Observer,  observable,  controller  and 
controllable.  In  the  present  application,  and  in  the  case  of  nonlinear 
observers  in  general,  we  will  have  occasion  to  use  only  the  first  two: 


Observable  form: 

x  =  Ax  -  Ba(x) 

y  =  Cx  (2) 

Observer  form: 

x  =  Ax  -  a(Cx) 

y  =  t(Cx)  (3) 

In  some  particular  cases  (in  the  application  treated  in  this  paper,  for 
example)  we  will  require  a  modified  observer  form. 

Modified  observer  form: 

x  =  Ax  -  a(Cx,CAx,...,CAn  Ix) 
y  =  y(Cx)  (4) 

where  A,  B,  C  are  the  standard  matrices  of  the  Brunovsky  canonical  form. 

For  the  case  of  a  three  dimensional  system  with  a  single  output: 


'0  1  o' 

o' 

A  = 

0  0  1 

,  B  = 

0 

0  0  0 

1 

C  =  [  1  0  0  ]  (5) 


The  system  (1)  can  be  transformed  into  observable  form  if  y  and  its  first  n-1  time 
derivatives  are  local  coordinates  on  the  state  space. 

A  physical  system  will  never  satisfy  exactly  a  set  of  differential  equations  ' 


like  (1).  In  reality  each  of  the  states  is  affected  by  random  process  noise,  the 
parameters  are  not  known  exactly,  and  the  measurement  of  the  output  will  be 
affected  by  observation  noise  due  to  the  physical  and  technical  limitations  on 
the  measurement  procedure.  The  system  (1)  can  then  be  expressed  in  its 
stochastic  equivalent: 


dC  =  f(Odt  +  Bdw 
dy  =  h(Qdt  +  Ddv 

m = n(^,  p0) 

Where  w  and  v  are  standard  Wiener  processes.  The  covariances  of  the  driving 
and  measurement  noise  are,  respectively,  Q  =  BB^  and  R  =  D^.  For  our  work 
we  will  assume  the  measurement  noise  to  be  small,  i.e.  D  =  £  where  £  is  a  small 
parameter.  Then  the  problem  is  to  estimate  the  states  of  (6)  at  time  t  given  the 
measurement  of  y  at  time  less  than  or  equal  to  t.  It  is  clear  that  in  the  absence  of 
noise  and  if  (1)  is  observable  the  problem  is  easily  solved  because  the  states 
could  be  computed  exactly  as  nonlinear  function  of  derivatives  up  to  order  n  -  1 
of  the  output. 

We  compute  the  estimates  by  introducing  the  filter: 

dC  =  f(C)dt  +  g(C)(dy-h(Qdt) 
dy  =  h(C)dt 

The  conventional  extended  Kalman  filter  technique  computes  the  estimates  from: 

dC  =  f(C)dt  +  P(t)HT(t  t)R-1(dy  -  h(Qdt) 


(6) 


dy  =  h(C)dt 


5 


where 


and 


p  =  F(C ,  t)P  +  PFT(C ,  t)  +  Q(t)  -  PHT(C ,  t)R4H(C ,  t)P 

P«o>  =  po  (9) 


(10) 


(11) 


and  P,  Q  and  R  are,  respectively,  the  covariance  matrices  of  the  estimate 
error,  the  process  noise  and  the  observation  noise. 

The  GANF  computes  the  filter  in  observer  form  coordinates.  In  these 
coordinates  the  system  (6)  can  be  written  as: 


dx  =  (Ax  +  a(y))dt  +  Gdw 

dy  =  Cxdt  +  Ddv  ( 1 2) 

x(0)  =  N(xq,  P0) 

where,  if  J  is  the  Jacobian  of  the  change  of  coordinates,  G  =  J  B.  If  the 
measurement  noise  is  small,  and  a(y)  is  smooth  enough,  then  a(y)  is 
approximately  equal  to  a(Cx)  and  we  can  use  the  filter  equations 


dx  =  Axdt  +  a(Cx)dt  +  PCTR 4(dy  -  Cx  dt )  (13) 

with  P  being  obtained  from  the  solution  to  the  Riccati  differential  equation 


P  =  AP  +  PAT  +  Q-  PCTR_1CP 
P(0)  =  P0 


6 


(14) 


Where  Q  =  GG^.  The  dynamics  of  the  error  are  given  by: 

dx  =  (A  +  PCtR_1C)  xdt  +  (a(y)  -  a(Cx  ))dt  +  Gdw  +  PCTR_1Cedv  ( 1 5) 

The  Riccati  equation  (14)  is  state  independent  and  hence  can  be  integrated 
off-line.  The  dynamics  of  the  error  are  nearly  linear  up  to  output  injection, 
and  the  covariance  of  x  asymptotically  equals  P(t).  We  can  assume,  without 
loss  of  generality,  that  the  output  injection  term  can  be  expanded  in  a  Taylor 
series  starting  from  the  second  order  term: 

a  d2a(Cx)  ~  2 

<x(y)  -  a(Cx)  =  -  (Cx  +  edvf  +  0(3)  (16) 

dy2 

In  fact,  we  can  combine  any  linear  term  of  the  above  in  the  (A  +  PCTR^Qx 
term  of  (15).  Then,  if  the  second  derivative  of  a  is  small,  the  output 
injection  term  can  be  neglected.  Practically,  we  are  requiring  a  to  have  a 
small  "curvature"  in  some  sense. 

One  should  note  that  these  two  filters  are  not  being  compared  on  exactly 
the  same  grounds  because  the  noise  covariances  of  the  EKF  and  the  GANF 
are  state  independent  in  both  t,  and  x  coordinates.  However  the  comparison 
is  justified  if  the  Jacobian  of  the  change  of  coordinates  J  is  nearly  constant 
along  the  trajectory  since  Q  =  JQJ7  and  R  is  invariant  under  the  change  of 
coordinates.  The  main  advantage  of  the  GANF  over  the  EKF  is  that  the 


T 
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Riccati  equation  (14)  can  be  solved  off-line.  This  allows  the  computation  of 
g(Q  off-line,  while  when  using  the  EKF  it  is  necessary  to  integrate  (9)  and 
compute  (10)  and  (11)  on-line.  In  particular  if  the  problem  can  be 
transformed  into  observer  form  with  a(y)  =  0  then  the  GANF  is  the  optimal 
filter  while  the  extended  Kalman  filter  is  generally  not. 

2-  An  Application  of  GANF 

We  consider  a  falling  object  in  an  atmosphere  of  varying  density.  The 
problem  is  to  estimate  the  position,  the  velocity  and  the  ballistic  coefficient 
of  the  object.  This  problem  has  been  discussed  previously  by  Gelb  [3]  and 
Wishner  et  al  [4], 

The  dynamic  model  consists  of: 

y  =  t. 

^“-g  +  Pttp^C,  (17) 

£3=0. 

t,(o)=q 
yo) = q 

C,(0)  =  C3° 

where  ^  is  the  vertical  position,  the  velocity,  £  the  inverse  of  the 


ballistic  coefficient  and 
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P(Ci)  =  P0exP(--jp) 


(18) 


represents  the  variation  of  the  atmospheric  density  with  the  height  (j.  It  can 
be  verified  that  this  system  is  observable  according  to  the  observability 
condition  defined  in  section  1.  In  fact 


dh  =  [>  0  °] 
dL,h=[0  1  0] 


3 

span  R  ,  except  at  the  singular  point  =  0  where,  the  object  being 

stationary,  it  is  impossible  to  observe  any  of  its  dynamics.  Hence  it  is 
possible  to  write  (17)  in  Observable  form: 

y  =  S, 

L  g  L  ,  , 

Vf3(^)“_gk  +(2t  ~k^3  +  £^3 

p  Sj  p  ^2 


(19) 


(20) 


The  Jacobian  of  the  transformation  between  C,  and  t,  is 


1 


0 


o 


p 


0 

0 

pq 


(21) 
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Unfortunately  (20)  does  not  satisfy  the  conditions  for  the  transformation  to 
observer  form,  but  it  can  be  transformed  into  the  modified  observer  form. 

If  we  let 


0\2 


5° 


<x(5)  =  - - *-=■  E,  =  £°(  2  -  — ) 

1  V  2^  -$2  S  ^  C 


H.ftJ'ffllWkX  V -."T  .) 


2  2 


°s<V“ 


p  (S/ty2 


■On  2 


(22) 


and 


xl 


X2  =  ^2 

*3=lH  +a2(y 


(23) 


then  the  system  can  be  written  in  modified  observer  form: 


y=xt 

xi  =x2-a,(y 

x2=x3-a2(y 

x3=-a(V 


(24) 


In  some  sense  this  change  of  coordinates  linearizes  the  system  "as  much  as 


possible". 

Assuming  the  model  to  be  affected  by  random  process  and  measurement 
noises,  the  corresponding  stochastic  differential  equations  become: 

dy  =  Xjdt  +  Ddv 

dXj  =  (x2-a(^))dt  +  B1dw, 

d*2  =  ( X3  -  a2(^))dt  +  B2dw2 

dxj  =  -  a3(^)dt  +  B3dw3 

with 

Xj(0)  =  N(  x3,P(l,l)) 
x2(0)  =  N(x^,P(2,2)) 
x3(0)  =  N(x5,P(3,3)) 

Q  =  BBT  r  =  ddt 

where  R  is  the  measurement  noise  covariance  and  Q(l,l),  Q(2,2),  Q(3,3) 
are  the  process  noise  variances.  P(l,l),  P(2,2),  P(3,3)  are  the  variances  of 


,A  A  , 

dy  =  Xjdt 

dxj  =  (x2  "aj(^2))dt +  K,(dy  -  dy) 
dx2  =  (x3-a2(E2))dt  +  K2(dy-dy) 
dx3  =  -a(ydt  +  K3(dy-dy) 


the  errors  in  the  initial  conditions. 
The  filter  dynamic  equations  are: 


11 

Rewriting  the  system  in  the  original  coordinates,  the  filter  equations 
become: 


dC^dt  +  K^dy-^dt) 


dC2  =  ( -g  +  p(C,  )C^3)dt  +  (^)\(dy  -  tdt) 


(27) 


C° 

A  A 

a  L  C 


r  1  A  L  A  A 

P  kpP(V(v  P(C,K^)2 


KjMdy-^dt) 


where  K^.K  , K  ,  are  the  gains  computed  from  (13)  and  (14)  with 
K=PCTR'1. 

The  Jacobian  of  the  transformation  from  physical  coordinates  to  observer 
form  is: 


1  0  0 
fe-f  0 

±  1 

kP  kpp(C°)2  P(CJ)2 


P  =  P0e 


(28) 


The  Jacobian  turns  out  to  be  close  to  a  unitary  operator.  This  is  clear  for 
the  2x2  upper  left  minor.  In  fact,  simulations  were  computed  both 
assuming  the  Jacobian  constant  evaluated  at  and  computing  its  actual 


value  on  line.  The  results  were  not  affected  in  an  appreciable  manner. 


3  -  Simulations  and  Results 

In  this  section  we  present  the  results  of  some  simulations,  run 
different  noise  regimes  and  with  the  following  constants: 
initial  conditions: 

initial  height  =  30,000  m 
initial  velocity  =  -  2,000  m/sec 
initial  inverse  ballistic  coefficient  =  1 .025 * 1 O’4  m2/kg. 
physical  parameters; 

atmospheric  decay  constant  kp=  10,000  m 
atmospheric  density  at  sea  level  pQ=  1.230  kg/m3 

gravity  acceleration  g  =  9.8 1  m  /  sec2 

Additionally,  the  results  correspond  to  the  following  noise  regime: 

measurement  noise 

R  =  100  m2sec 

process  noises 

Q(  1,1)  =  100  m2 
Q(2,2)  =  100  m2/sec2 
Q(3,3)  =  1.0*10'10m4  /  kg2 
uncertainties  on  the  initial  conditions 
P(l,l)  =  5000  m2 
P(2,2)  =  2000  m2/  sec2 
P(3,3)  =  1.0*  10'8  m4/  kg2 


The  object  falls  from  an  initial  height  of  30,000  m  with  an  initial 
downward  speed  of  2,000  m/sec.  It  has  an  initial  ballistic  coefficient  of 
9,756  kg/m^,  which  is  equivalent  to  the  object  weighing  approximately  ten 
metric  tons  per  square  meter  of  surface  perpendicular  to  the  direction  of  the 
fall. 

The  errors  in  the  initial  conditions  are,  probably,  not  unrealistic  for  a 
radar  tracking  problem.  The  process  noise  covariances  are  also  close  to 
reality  if  we  consider  the  possible  random  effects  of  changes  in  the 
conditions  of  the  atmosphere  and  wind  encountered  on  the  way  to  the 
ground  along  the  trajectory.  Finally,  the  process  noise  on  the  ballistic 
coefficient  could  be  interpreted  as  variations  of  the  shape  or  orientation  of 
the  object  during  the  fall. 

The  behavior  of  the  real  system  is  presented  in  Fig.  1,  which  portrays 
the  trajectories  of  the  three  states  affected  by  the  noises.  We  remind  the 
reader  that  the  first  state  is  the  height  of  the  object,  the  second  the  velocity 
and  the  third  the  inverse  ballistic  coefficient. 

In  Fig.  2(a)  are  shown  the  errors  between  the  estimates  of  the  height  of 
the  two  filters  and  the  height  of  the  real  system  (Fig.  1(a)).  The  EKF  and 
the  GANF,  compared  in  terms  of  performance  in  the  estimate  of  the  height, 
are  nearly  equivalent.  The  EKF  performed  slightly  better,  but  the  time 
history  of  the  error  is  nearly  identical.  Similarly,  in  Figs.  2(b)  and  2(c)  are 
shown  the  estimates  of  the  velocity  and  the  inverse  ballistic  coefficient, 
respectively.  Although  the  two  filters  showed  a  very  similar  behavior,  the 
EKF  performed  slightly  better,  the  difference  probably  being  mainly  due  to 
the  approximation  introduced  in  assuming  that  the  noise  covariances  were 
constant  in  observer  coordinates  for  the  GANF.  In  Fig.  2(d)  is  shown  the 


behavior  of  the  real  inverse  ballistic  coefficient  and  of  its  estimates  to  give 
the  reader  an  idea  of  the  initial  error  in  the  estimates  and  of  the  effect  of  the 
process  noise.  Without  process  noise  the  real  coefficient  would  be 
constant. 

Shown  in  Fig.  3(a)  is  the  logarithm  of  the  average  covariance  of  the 
error  in  the  estimates  of  the  height  for  25  Monte  Carlo  runs.  As  can  be  seen 
from  the  Figs.  3(a)  and  2(a)  the  recovery  from  errors  in  the  initial  guess  is 
very  fast  after  which  the  covariance  settles  down  to  values  close  to  process 
noise  covariance  for  the  height.  There  is  no  appreciable  difference  in  the 
behavior  of  the  two  filters.  In  Figs.  3(b)  and  3(c)  arc  shown  the  logarithms 
of  the  average  covariance,  for  25  Monte  Carlo  runs,  of  the  error  in  the 
estimates  of  the  velocity  and  the  inverse  ballistic  coefficient,  respectively. 
Again  the  two  filters  performed  very  similarly.  In  Fig.  3(c)  the  two  filters 
behaved  so  similarly  that  the  two  curves  are  almost  indistinguishable.  All 
these  results  were  checked  by  insuring  that  the  covariances  of  the  errors 
were  near  the  values  of  the  covariances  theoretically  predicted  by  the 
solution  of  the  Riccati  differential  equation. 


Conclusion: 

The  simulation  results  demonstrate  that  the  GANF  filter  performed 
practically  as  well  as  the  extended  Kalman  filter  from  the  point  of  view  the 
accuracy  of  the  estimates. 

In  terms  of  the  algebraic  efforts  in  developing  the  filter  equations,  the 
EKF  is  obviously  more  straightforward,  since  one  need  only  evaluate  a 
first  order  approximation  to  the  nonlinear  equations  along  the  trajectory. 
The  GANF,  on  the  other  hand,  relies  on  differential  geometric  concepts, 


and  require  considerably  more  difficult  algebraic  computations  off-line. 
However,  the  development  of  the  algebra  may  eventually  became  a  simple 
exercise  in  computer  programming  if  currently  popular  symbolic 
manipulation  programs  like  MACSYMA  or  SMP  are  used 

The  real  advantage  of  the  GANF  filter  over  the  extended  Kalman  filter  is 
in  its  computational  efficiency.  In  fact,  as  mentioned  previously,  we  can 
compute  the  gains  of  the  GANF  filter  off-line,  whereas  the  gains  of  the 
extended  Kalman  filter  must  be  calculated  on-line.  For  the  particular 
simulation  presented  in  this  paper,  written  in  FORTRAN  language  and 
executed  on  a  VAX  785  computer  running  under  the  VMS  operating 
system,  the  integration  of  the  GANF  filter  along  the  entire  trajectory 
required  1.06  seconds  of  CPU  time,  while  the  integration  of  the  extended 
Kalman  filter  took  3.2  seconds.  Thus  the  GANF  filter  has  performed  three 
times  faster.  With  higher  order  systems  the  computational  advantage  will 
be  further  emphasized  since  the  on-line  computational  burden  of  the 
extended  Kalman  filter  grows  as  (n2  +  3n)/2  while  that  of  the  asymptotic 
nonlinear  filter  grows  only  as  n.  In  fact,  solving  the  Riccati  differential 
equation  on-line  requires  the  integration  of  (n  +  n)/2  scalar  differential 
equations. 

List  of  figures: 

Fig.  1  Nominal  trajectory  of  the  states  of  the  falling  object  in  the  presence 
of  random  noise,  (a):  height,  (b):  velocity,  (c):  inverse  ballistic  coefficient. 
Fig.  2  Errors  of  the  extended  Kalman  filter  and  of  the  geometric  asymptotic 
nonlinear  filter  in  the  estimates  of  the  states  of  the  falling  object,  (a):  height, 
(b):  velocity,  (c):  inverse  ballistic  coefficient,  (d):  estimates  of  the  inverse 


ballistic  coefficient  together  with  the  actual  inverse  ballistic  coefficient 
Fig.  3  Variances  of  the  errors  of  the  extended  Kalman  filter  and  of  the 
geometric  asymptotic  nonlinear  filter  in  the  estimates  of  the  states  of  the 
falling  object  after  25  Monte  Carlo  runs.  Plotted  using  logarithmic  scale,  (a): 
height,  (b):  velocity,  (c):  inverse  ballistic  coefficient 
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Computation  of  Observer  Normal  Form 
Using  Macsyma * 

by  Andrew  R.  P helps  f  and  Arthur  J.  Krenerf 


1.  Introduction 

The  computation  of  linear  observers  has  become  relatively  routine,  and  computer 
packages  exist  which  make  these  computations  straightforward  and  accessible.  When  it 
comes  to  nonlinear  observers,  however,  the  picture  has  not  been  so  bright.  Algorithms  for 
this  sort  of  calculation  have  been  published  [l],  [2],  [4],  [5],  [6].  In  general,  these  are  limited 
by  one  or  more  steps  involving  difficult  computations. 

The  Ph.D.  thesis  of  Phelps  [8]  provides  a  breakthrough  in  the  nonlinear  observer  al¬ 
gorithm.  In  particular,  Lie  bracket  calculations  axe  no  longer  required  to  perform  changes 
of  state  coordinates,  and  the  computation  becomes  straightforward.  Macsyma  was  in¬ 
strumental  in  developing  the  new  approach.  A  prototype  of  this  new  algorithm  has  been 
implemented  in  Macsyma. 

We  consider  an  uncontrolled  dynamical  system  with  partial  state  observation: 

( =  m . 

»=*(«•  (i) 

The  state  space  is  in  Rn  and  the  output  space  is  in  Rp.  Generally,  this  may  be  put  in 
observable  normal  form: 


y  =  C{.  (2) 

The  problem  is  to  see  if,  in  fact,  it  supports  observer  normal  form: 

i  =  Ax  -  a(C x ) , 

y  =  Cx.  (3) 

Here  the  A,  B  and  C  are  matrices  given  in  Brunovsky  canonical  form. 
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The  algorithm  in  question  is  determined  by  the  conditions  required  for  conversion  of 
a  system  (1)  to  observer  form  (3).  This  paper  is  based  on  the  approach  in  [4]  and  [5],  as 
modified  in  [8],  These  conditions  are: 

Observable  form  Must  be  able  to  convert  system  to  observable  form  (2); 

Output  coordinate  change  Must  satisfy  d.e.  for  y  =  y(y); 

Polynomial  degree  Observable  form  polynomials  /)(£),  for  1  <  j  <  P,  (the  entries  of 
B  <*(£)  in  (2))  must  have  degree  < 

Coefficient  compatibility  Observable  form  polynomials  must  evaluate  to  certain  in¬ 
tegrals  of  differential  expressions  in  injection  terms  (the  entries  of  a(C  x)  in  (3)). 

Note.  An  earlier  version  of  these  conditions  replaces  the  coefficient  compatibility 
condition  with  the  condition  that  the  certain  brackets  vanish.  Let  qj  be  the  unit 
vector  in  the  £j.(j  direction.  All  brackets  of  elements  in  {ad'Sj1  qj  :  1  <  i  <  tj  }  must 
vanish. 

The  approach  in  [1],  [2]  and  [6],  which  is  not  used  here,  has  been  developed  only 
for  the  case  when  p  =  1  and  there  is  no  change  of  output  coordinates  required.  It 
calls  for  the  existence  of  observable  form  (2)  (but  not  its  computation)  and  replaces 
the  last  two  conditions  with  a  requirement  that  d(ade_^q)  6  span  { dh },  plus  a  slight 
technical  adjustment. 

We  will  elaborate  the  theorems  and  computations  associated  with  the  coefficient  com¬ 
patibility  condition,  which  will  obviate  the  need  for  extensive  bracket  computations. 

2.  Coefficient  Compatibility  in  Standard  Coordinates 

For  simplicity’s  sake,  we  first  describe  the  results  in  the  case  that  we  have  the  “right” 
output  y.  In  fact,  we  will  see  that  this  could  be  considered  sufficient  for  an  improved 
algorithm,  since  the  relevant  change  of  state  coordinates  will  be  entirely  determined  by 
the  tranformation  y  =  y(y)  of  the  corresponding  outputs. 

In  section  3,  however,  we  will  indicate  a  result  expressed  directly  in  observable  form 
coordinates  (£,y). 

We  may  compute  observable  form  ((£,  y)  coordinates),  relative  to  the  output  y  which 
is  the  solution  to  the  output  d.e.  ’s: 

£=A£-Z?a(0, 

y  =  c £.  (4) 


We  call  the  coordinates  (4)  standard  coordinates. 

This  computation  does  not  constitute  a  major  burden  on  our  algorithm  (given  Mac- 
syma),  since  we  only  require  an  iterated  set  of  Lie  differentiations  of  functions  and  back- 
substitutions  to  get  the  transformation  £  =  £(£). 

To  annotate  our  coordinate  systems,  we  adopt  certain  conventions.  We  describe  the 
state  variables  by  £i:j,  indicating  that  it  is  the  (j  —  l)-th  time  derivative  of  the  output 
variable  y,-,  the  same  going  for  (a;,y)  and  (£,  y)  coordinates.  The  injection  functions 
are  written  similarly.  If  p  =  1,  we  omit  the  “1  to  simplify  the  notation. 
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Furthermore,  the  coefficient 


Q-m:---(y)  ■—  •••  +  l •••«,•  :*  +  l  —  |  — (if) 

' - v - ' 

r. j  . fc  times 

v - - - - 

0) 


of  the  monomial 

<’ 1  ==  n  n  mu,  .....  (*) 

j=i  *:=i 

is  characterized  by  having  degree  ij,k  and  exponent  eJ;fc  with  respect  to  its  factor 

for  1  <  fc  <  rj  and  1  <  3  <P-  The  vertical  bars  ‘ — ’  separate  the  subscript  into  parts 

according  to  the  the  output  j  involved.  We  also  have  cumulative  indices 

ri  p  P  ri 

ej  :=  er-k  *  e:=^2  e>  and  *":=EE  er-k  • 

it=i  j=i  i=i  fc=i 

To  simphfy  the  notation,  we  also  occasionally  represent  functions  as  their  own  “0-th” 
derivatives  and  we  write  for  i  <  0,  as  a  null  symbol  indicating  a  contribution  which 

vanishes. 

The  natural  way  to  describe  the  change  from  observable  form  coordinates  £  to  ob¬ 
server  form  coordinates  x  is  to  compute  the  d.e.  ’s  which  determine  it.  These  get  quite 

dy 

complicated,  since  they  involve  the  change  of  coordinates  matrix  J  =  —  and  its  iterated 

derivatives.  The  choice  of  standard  coordinates,  however,  causes  all  these  terms  to  vanish 
and  makes  these  d.e.  ’s  accessible. 

Using,  for  convenience,  the  simplifying  assumption  that  each  observability  index  £} 
is  equal  to  some  £,  we  compute  the  equations  x  =  z(f)  governing  the  change  of  state 
coordinates: 


£j:l  —  xj:  1  5 
2  =  X>:2  “ 

(j: 3  -  Xj, 3  -  L;aj:1(y)  -  ai:2(y) , 


^-i 

~  x}  tj  ~  L  j  ay.i{y)i 

i=  1 


M)  =  -EL/ 


i=l 


(6) 


for  1  <  j  <  p,  where  /  is  /  in  the  £  coordinates. 

From  the  expansion  (6)  we  can,  in  effect,  read  off  the  a  polynomial  coefficients  in 
terms  of  the  a  injection  functions. 
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Example  2.1.  Coefficient  solutions,  for  the  case  p  =  1  and  t  =  3. 
The  expansion  (6)  gives  us: 

/i(0  =  <*23  (y)66  +  a3(y)£3 

+  a222(y)^23  +  «22(y)^22  +  <*2(y)£2 

+  ai(y). 

This  leads  to  the  coefficient  solutions: 


Oi  = 

~<*3  , 

da  2 

a2  = 

dy  ’ 

d2ai 

dii  = 

dy2 

0222  ~ 

o, 

<*23  = 

o, 

daj 

a3  = 

dy 

The  a ’s  may  be  computed  from  the  a ’s  by  a  simple  integration.  A 

The  pattern  described  in  the  above  example  is  easily  extended  to  the  case  where  we 
have  p  equal  indices. 

Theorem  2.2.  Suppose  that  all  the  indices  are  equal,  i.e.,  tj  =  £  for  1  <  j  <  p.  Then  the 
polynomial  coefficient  is  given  by 


w: 


P  ri 


ii  n  «**»**!'"■ 

j=  1  k=  1 


(7) 


The  existence  of  an  injection  vector  a(y )  compatible  with  all  the  coefficients  of  /j  (£),  as 
given  in  (7),  together  with  the  observable  form,  output  coordinate  change  and  polyno¬ 
mial  degree  conditions,  constitute  necessary  and  sufficient  conditions  for  the  existence  of 
observer  normal  form  (supposing  all  indices  equal). 


Proof 

Outline 

We  use  induction  on  £.  We  make  a  counting  argument  with  combinatorics  to  trace 
the  typical  coefficient  of  a  monomial  term. 

Body  of  Proof 

Let  m,  1  <  m  <  p,  be  fixed. 

First  of  all,  if  £  =  1,  then  (4)  simply  becomes  £m:1  =  /m(0  =  /m(y)-  Thus  am:1  = 
/m(y) =  ~am.i(y)i  where  we  take  am;]  :=  —  fm-  But  this  matches  formula  (7). 
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Note  that  the  derivation  in  example  2.1  gives  a  result  in  accordance  with  formula  (7). 
That  illustrates  the  pattern  we  use  for  our  induction. 

As  induction  hypothesis,  we  assume  that  formula  (7)  holds  for  t  =  /x.  We  target  the 
coefficients  in  the  expansion  for  —  xm:(l  in  the  case  t  =  \i  +  1  which  axe  the  source  via 
Lie  differentiation  of  the  coefficients  of  fm  we  seek  to  evaluate. 

But,  when  we  examine  the  p.d.e.  expansion  (6),  we  find  the  same  expression, 

-  y,  *r  ■ 

i=l 


for  in  the  expansion  with  t  =  //  +  1,  as  we  find  for  fm  in  the  expansion  with 

i  =  /x.  This  means  that  the  induction  assumption  will  enable  us  to  know  those  “target” 
coefficients,  which,  when  Lie  differentiated,  give  contributions  to  the  expansion  for  fm 
in  the  case  when  the  multi-index  £  is  \i  +  1.  These  coefficients  evaluate  to  nothing  but 
the  coefficients  of  the  terms  of  /m  in  the  case  (given  by  induction  assumption)  that  the 
multi-index  l  is  fx. 

Let  £*,  given  as  in  (5),  be  a  monomial  in  /m(£).  We  wish  to  determine  its  coefficient 

An  expression  which  under  Lie  differentiation  by  /  can  give  a  term  like  may  have 
two  forms. 

Case  1.  It  may  have  no  increment  in  the  exponent  eJ:1  of  £j: 2,  for 

In  this  instance,  it  will  come  from  Lie  differentiation  of  a  term  of  the  form 


By  induction  assumption,  term  (8)  has  a  numerical  coefficient  which  calculates  back 
from  our  projected  coefficient  for  £*  as 


j—  1  fc=i 


Here  we  take 


^5 _ f  Cff-.k  —  1  "b  1  ,  if  —  \  —  f-ry.k  1  > 

(  1 ,  otherwise . 


The  differentiation  process  contributes  an  extra  factor  of  eJ.  Thus,  the  partial  numerical 
contribution  from  this  term  is 

Itf.k  ■  (9) 

In  this  case,  the  “a”  part  of  the  coefficient  is  carried  through  unchanged.  Moreover,  it 
was  already  of  the  required  form,  since  en  has  not  been  altered  by  adding  and  subtracting 
1  in  (8). 


Case  2.  It  may  have  an  increment  in  the  coefficient  of  £J:2  for  j  =  rj. 

The  source  of  the  terms  of  this  type  is  partial  differentiation  of  the  “a”  coefficient  by 
yv.  This  will  give  an  additional  factor  of  £.V:2,  while  not  affecting  the  numerical  coefficient. 
The  prior  exponent  of  will  have  been  ev.\  —  1. 

Therefore,  the  monomial  term  prior  to  Lie  differentiation  was  £*  .  By  induction 

assumption,  its  numerical  contribution  was 


w\ 


j=i  jt=i 


Its  partial  numerical  contribution  is  therefore 


(10) 


since  tj:  i!  =  1  =  iJ:1  when  j  =  77. 

Note  that  in  this  case  the  ev  increments  by  1,  so  that  the  “a”  term  adjusts  as  prescribed 
by  formula  (7). 


These  two  cases  exhaust  the  possibilities.  The  “a”  coefficients  are  as  required.  And, 
combining  (9)  and  (10),  we  get  a  total  contribution  to  the  numerical  coefficient  of  a  factor 
of 


(£i>  «**)(= 

\7= 1  k=l  /  X 


-  =1, 
W 


which  is  also  as  required.  A 

In  the  general  case,  where  the  observability  indices  are  given  arbitrarily,  we  adopt  a 
recursive  method  for  calculating  the  a  coefficients  in  terms  of  the  or  injection  functions. 

We  may  take  a  system  with  arbitrary  indices  . . . ,  tp,  and  prolong  it  to  a  system  of 
dimension  p  ■  £p ,  to  which  theorem  2.2  applies.  Retracing  the  prolongation  step-by-step, 
we  can  track  the  (increasingly  complex)  form  of  the  formulas  for  the  a  coefficients.  We 
rely  on  the  following  prolongation  lemma  (for  a  proof,  see  [8]): 

Lemma  2.3  (Krener-Respondek-Phelps).  Suppose  an  uncontrolled  system,  given  in 
observable  form,  has  2  distinct  multi-indices  £2  of  multiplicities  ,  p2  and,  further,  that 
it  may  be  transformed  by  change  of  output  coordinates  y  =  y(y)  to  observer  form.  Then 
it  may  be  prolonged  to  a  system  in  observable  form,  having  multi-indices  Ai  :=  £j  -f  1  and 
A2  :=f2,  of  the  above  multiplicities.  Furthermore,  the  transformation  y  =  y(y)  and  the 
injection  function  a(-)  both  prolong  trivially  to  functions  which  will  take  the  prolonged 
system  over  into  observer  form. 


We  formulate  our  recursive  coefficient  calculation  as  follows: 


Algorithm  2.4  (Coefficient  Prolongation  Algorithm).  Suppose  we  have  solved  for 
the  coefficients  belonging  to  the  indices  £i, . . .  ,£k  and  moreover  £*  <  4  +  1  =  ...  =  £s, 
where  s  <  p.  We  may  construct  a  “ quasi-solution ”  using  the  method  of  theorem  1  applied 
to  the  prolonged  system  with  s  indices  all  equal  to  £a.  By  back-substitution  we  may  then 
express  the  prolonged  version  of  fj(£)  with  a’s  as  coefficients,  for  k  +  l<j<s.  We  may 
then  substitute  L'j £h.lh  (expressed  in  terms  of  the  a’s,  and  using  the  solutions  previously 

derived  for  1  <j<  k)  for  the  “ quasi-variables ”  £h:ti,+i  where  1  <  i  <  £a  —  £/,.  Finally,  we 
may  “read  off”  the  coefficients  of  the  monomials  thus  derived. 

We  may  now  combine  theorem  2.2  and  algorithm  2.4  to  get  the  coefficient  compatibility 
theorem  for  £  coordinates: 

Theorem  2.5.  Suppose  we  have  arbitrary  indices  £\,...,£p.  Then  we  may  derive  the 
coefficients  dm:..(y )  in  terms  of  the  a(y)  injection  functions  by  application  of  the  Coefficient 
Prolongation  Algorithm.  The  existence  of  an  injection  vector  a(y)  compatible  with  all 
the  coefficients  of  fi(£),  together  with  the  observable  form,  output  coordinate  change 
and  polynomial  degree  conditions,  constitute  necessary  and  sufficient  conditions  for  the 
existence  of  observer  normal  form. 


Proof 

This  has  already  been  done  in  theorem  2.2  for  the  case  where  all  indices  are  the  same. 
Algorithm  2.4  enables  us  to  extend  this  result  inductively  whenever  £/,  <  £h+ j.  A 

It  can  also  be  shown  that  we  can  back-solve  for  the  a  injection  functions  in  terms  of 
the  a  coefficients  by  iterated  integrations  (see  [8]). 

For  the  generic  case,  where  there  are  two  distinct  indices,  differing  by  1,  we  state  the 
formula: 

Corollary  2.6.  For  the  generic  case  of  two  different  multi-indices  of  size  A2  and  A2  :=  A2  +1 
and  multiplicities  p2  and  p2,  the  coefficient  am;...(y)  is  given  by 


title;  k\h'e>' 


ST  (  dec*i:Ai-u;(;/)  da r 

h\dyy1'-  dyrp  “ 


dyi 


7  =  1  fc=l 


deam:x2-u>(y) 
dy(l  ■  ■  ■  dx/p” 


In  particular,  am:...(y )  is  given  by  theorem  1  for  1  <  m  <  p\  and  for  degree  >  A2  when 

Pi  +  l  <  m  <  p. 


Proof 

This  is  directly  calculated  using  theorem  2.2  and  one  application  of  algorithm  2.4.  A 

To  conclude  this  section,  we  give  an  example  of  coefficients  for  the  (simplest)  non¬ 
generic  case.  Note  that  it  is  trivial  to  back-solve  for  the  derivatives  of  the  a ’s  and  integrate. 


Example  2.7.  Coefficient  solutions,  for  p  =  2,  =  1,  £2  =  3. 


—  <*2:3  , 


<*1:1  - 

-  —  <*1:1  5 

2  2  <*2:1 

dot  1:1  da2:l 

+  <*1:1 

02:1  - 

=  a*? 

ttl'  dy\  dyi 

o  32<*2:1 

<9<*1:1  0<*2-.l 

C><*2:2 

02:|2  = 

-  2  <*l;l  _  _ 

°y\  °y2 

dy2  dy\ 

dy2 

<92<*2:  i 

a2:'22  ~  "  ~dgT  ' 

da2:  i 

aj!|>  ~~~W 


<9or  2:2 
#yi 


A 


3.  Coefficient  Compatibility — General  Case 

We  have  seen  the  expansion  (6)  in  the  simplified  situation  of  standard  coordinates. 
This  can  be  converted  to  general  observable  form  coordinates  (2)  by  a  not-too-inconvenient 
calculation.  However,  if  we  have  Macsyma  or  some  other  facility  that  enables  us  to  do 
suggestive  examples  expeditiously,  we  can  find  patterns  in  the  results  that  suggest  direct 
solutions  for  the  general  version  of  (6).  For  instance,  consider: 

Example  3.1.  Coefficient  solutions,  in  general  observable  form  coordinates,  for  p  —  1, 
t  =  4. 

We  have  the  following  pattern,  which  mimics  the  result  in  theorem  2.2: 


<*1  = 

a7  — 

°23  = 

a22  = 

<*3  = 

<*222  = 

a4 


dy 

dy 

0^4  , 

dy 

da2 

dy 

dy 

3  dy  d2oti 

dy  dy 2 

dy 

d2a2 

dy 

dy2  ’ 

dy 

da2 

dy 

dy 

dy 

d3a\ 

dy 

dy 3 

dy 

da  j 

dy 

dy 

We  have  another  pattern,  which  relates  to  the  degree  t  terms  that  vanish  in  standard 
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coordinates: 


1  d^CL  24  3  d(l24  1  3 

“ 1,52  =  4  Ip-  "  16  ^  +  64  ' 

3  da2  4  3  2 

a223  ~  2  ~df  ~8a24’ 

3 

033  =  —  024  • 

4 

A 


To  describe  the  intricacies  of  the  degree  t  terms  in  example  3.1,  we  introduce  the 
following  notational  scheme.  Let  P(m)  be  the  partitions  of  m.  Write  a  partition  7r  of  e  —  1 
by 

S 

e~i  =  YJcini- 

i=i 

a 

Define  c  :=  c_,  as  the  number  of  pieces  of  the  partition. 

j=i 

With  these  annotations,  we  formulate  the  pattern  of  example  3.1  for  the  following 
theorem  on  coefficient  compatibility: 

Theorem  3.2.  A  single-output  system  has  an  observer  normal  form  iff  the  observable 
form,  output  coordinate  change  and  polynomial  degree  conditions  hold,  and  moreover  in 
observable  form  (£ )  coordinates  the  coefficient  a...(y)  equals 


k=i 

for  terms  of  degree  less  than  (,  and 

w\  (e  —  1)! 


w!  )  deat-w(y)  dy 
dye  dy 


-  £ 


n 

k=l 


Ck'-ik'ek  JJcjlnj!' 
i=  i 


(t)‘ 


i=  i 


for  terms  of  degree  equal  to  f. 

A  proof  of  this  theorem  will  appear  in  a  forthcoming  paper  of  Phelps  [7], 

Using  Lemma  2.3  and  the  above  theorem  (adjusted  to  the  case  of  p  equal  indices),  we 
may  in  principle  compute  the  general  transformation  x  —  x(£),  relating  observer  form  (3) 
to  observable  form  (2). 


4.  Conclusion 

Two  points  need  to  be  made  here. 
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First,  the  “coefficient  compatibility”  approach  to  nonlinear  observer  calculations  sim¬ 
plifies  in  principle  the  theory  and  makes  unwieldy  bracket  calculations  unnecessary. 

Second,  the  use  of  Macsyma  made  it  possible  to  do  the  rather  extended  calculations 
of  examples  that  made  the  patterns  in  the  data  stand  out.  Every  aspect  of  the  algorithms 
for  nonlinear  observer  calculation  is  readily  accessible  to  Macsyma  programming,  and 
converting  the  algorithm  from  its  abstract  form  of  “algorithm-in-principle”  to  a  concrete 
“algorithm-in-fact”  is  naturally  done  in  this  milieu. 
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THE  STRUCTURE  OF  SMALL-TIME  REACHABLE  SETS  IN 
LOW  DIMENSIONS* 

ARTHUR  J.  KRENERt  and  HEINZ  SCHATTLER+ 

Abstract.  This  paper  outlines  a  general  method  to  determine  the  geometric  structure  of  small-time 
reachable  sets  for  a  single-input  control  system  with  a  bounded  linear  control.  The  authors'  analysis  relies 
on  free  nilpotent  systems  as  a  guide,  and  hence  their  techniques  only  apply  to  nondegenerate  situations. 
The  paper  illustrates  the  effectiveness  of  the  method  in  low  dimensions.  Among  other  results  is  given  a 
precise  description  of  the  small-time  reachable  set  for  a  system  x  =  f(x)  +  g(x)u.  |u|§  I  in  dimension  four, 
under  the  generic  assumption  that  the  constant  controls  u  =  +1  and  u  *  -1  are  not  singular.  As  a  corollary, 
a  local  synthesis  is  obtained  in  dimension  three  for  the  time-optimal  control  problem  under  the  analogous 
generic  condition. 

Key  words,  nonlinear  systems,  nilpotent  approximation,  reachable  sets,  bang-bang  trajectories,  singular 

arcs 


AMS(MOS)  subject  classifications.  49B10,  93BI0 


1.  Introduction.  In  this  paper  we  study  the  qualitative  structure  of  small-time 
reachable  sets  in  low  dimensions  for  a  single-input  system  with  a  bounded  linear 
control.  More  precisely,  we  consider  a  system  of  the  form 

(1)  I:  x=/(x)  +  g(x)u,  |u|£l,  JceR" 

where  /  and  g  are  smooth  (C°°)  or  analytic  vector  fields  and  admissible  controls  are 
measurable  functions  with  values  in  [-1, 1]  almost  everywhere.  A  trajectory  of  the 
system  corresponding  to  a  control  u(-)  is  an  absolutely  continuous  curve  x(  • )  such 
that  x (t)  —  f(x(t))  +  g(x(t))u(t)  almost  everywhere.  We  say  a  point  q  is  reachable  from 
a  point  p  within  time  T  if  and  only  if  there  exists  a  trajectory  x(-)  defined  on  an 
interval  [0,  t],  fS  T,  such  that  x(0)  =  p  and  x(t)  =  q.  The  set  of  all  such  points  q  is 
denoted  by  Reach  (p,  tg  T);  Reach  {p,  T)  denotes  the  set  of  points  that  are  reachable 
exactly  at  time  T.  The  reachable  set  from  p ,  Reach  (p),  is  the  set  of  all  points  that  are 
reachable  from  p  within  some  time  T. 

Reachable  sets  play  an  important  role  in  control  theory.  If  a  system  can  be  stabilized 
to  a  given  point  by  a  feedback  control  law,  then  that  point  must  be  in  the  reachable 
set  of  every  other  point.  In  optimal  control  problems,  if  the  cost  is  added  as  another 
coordinate,  then  the  optimal  trajectories  must  lie  in  the  boundary  of  the  set  of  reachable 
points.  For  this  reason  the  Pontryagin  Maximum  Principle  plays  an  important  role  in 
studying  the  boundaries  of  reachable  sets. 

The  problem  of  describing  a  reachable  set  and  the  extremal  trajectories  that 
generate  its  boundary  is  closely  related  to  the  problem  of  regular  synthesis  in  the  sense 
of  Boltyansky  [1]  and  others  [5],  [18].  While  the  problem  has  been  studied  extensively 
for  many  years,  only  a  few  examples  of  regular  syntheses  have  been  described,  for 
instance,  [24],  Even  in  low  dimensions,  the  reachable  set  of  a  general  control  system 
can  be  extremely  complicated. 
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We  shall  attempt  to  avoid  this  difficulty  by  considering  only  “nondegenerate” 
systems.  By  a  nondegenerate  system  we  mean  one  where  (i)  f  g,  and  the  low-order 
Lie  brackets  of  /  and  g  span  as  many  dimensions  as  is  possible  given  the  dimensions 
of  the  state  space;  and  where  (ii)  no  nontrivial  equality  relations  hold  between  those 
vector  fields  (for  instance,  if  n  is  the  space  dimension,  then  any  relation  saying  that 
n  vector  fields  are  dependent  at  a  point  is  considered  a  nontrivial  equality  relation, 
whereas  a  relation  that  simply  expresses  the  fact  that  a  vector  field  can  be  written  in 
terms  of  a  basis  is  considered  trivial). 

This  is  in  the  spirit  of  Lobry  [14],  who  described  the  small-time  reachable  set  of 
(1)  in  dimension  three  under  the  assumption  that  f  g,  and  [/,  g]  are  linearly  indepen¬ 
dent.  The  method  described  below  is  an  attempt  to  extend  Lobry’s  result  to  higher 
dimensions.  As  will  be  seen,  it  is  successful  in  the  four-dimensional  case,  but  in 
higher-dimensional  cases  obstacles  still  have  to  be  overcome.  These  obstacles,  however, 
are  not  due  to  our  general  approach,  but  they  lie  in  the  fact  that,  at  the  moment,  too 
little  is  known  about  the  structure  of  extremal  trajectories.  We  shall  return  to  this 
question  at  the  end  of  the  paper.  In  the  paper  we  shall  give  a  precise  description  of 
the  small-time  reachable  set  in  dimension  four  assuming  that  the  constant  controls 
u  =  +1  and  u  =  -1  are  not  singular  on  the  boundary  of  the  reachable  set.  It  can  easily 
be  seen  (cf.  §  4)  that  this  is  equivalent  to  an  independence  assumption  on  the  vector 
fields  f  g,  [ f  g],  and  [/+g,  [/,  g]],  respectively,  [/-g,  [ f  g]].  As  a  corollary  we  are 
able  to  improve  on  recent  results  of  Bressan  [4],  Schattler  [17],  and  Sussmann  [21] 
on  time-optimal  control  in  dimension  three. 

Throughout  this  paper  we  will  use  nilpotent  systems  as  a  guide  to  the  general 
situation.  A  system  is  nilpotent  of  order  k  if  all  brackets  of  orders  greater  than  k  vanish 
and  if  k  is  the  smallest  integer  with  this  property.  In  a  certain  sense  these  systems  play 
the  same  role  as  the  polynomials  do  within  the  class  of  smooth  functions.  Nilpotent 
systems  are  the  low-order  part  of  the  coordinate  free  Taylor  series  expansion  of  a 
general  system. 

To  be  more  precise,  we  must  define  the  Lie  jet  of  system  (1).  At  a  point  p  the  Lie 
jet  consists  of  a  list  of  the  values  at  p  of  the  Lie  brackets  of  /  and  g  written  down  in 
some  prescribed  order.  Of  course,  because  of  the  skew-symmetry  and  Jacobi  relation 

[/  g] +  [g, n  =  0,  [/,  [g,h]]  +  [g,  [/»,/]]  +  [/.,  [/,  g]]  - 0, 

we  need  only  consider  a  list  of  distinct  brackets.  These  brackets  can  be  partially  ordered 
by  the  total  number  of  vector  fields  involved;  for  example,  /  is  a  bracket  of  order  one 
and  [/  g]  is  of  order  two.  The  Lie  jet  of  order  k  is  a  list  of  values  at  p  of  the  distinct 
brackets  of  /  and  g  of  order  less  than  or  equal  to  k.  The  Lie  jets  of  orders  one  through 
four  are  given  below: 


Order  one:  {f(p),g(p)}. 

Order  two:  {/(/>),  g(p),  [f  g](p)}. 

Order  three:  {/(p),  g(p),  [/,  g](p),  [/,  [/,  g]](p),  [g,  [/  g]](p)}. 

Order  four:  {/(p),  g(p),  [/,  g](  p),  [/  [f,  g]](p),  [g,  [/,  g]](  p), 

U,  U,  If  g]]](p),  [f  [g,  [/,  p]]](p),  [g,  [g,  [/,  g]]](p)K 


If  N(k)  is  the  number  of  distinct  brackets  of  /  and  g  of  order  k  or  less,  then  the 
kth-order  Lie  jet  of  (l)  at  p  is  a  point  in  the  vector  bundle  consisting  of  the  Whitney 
sum  of  N{k)  copies  of  the  tangent  bundle. 

A  basic  result  of  Krener  [12],  later  proved  in  other  contexts  by  Rothschild  and 
Stein  [15],  Hermes  [10],  Crouch  [8],  Bressan  [3],  and  Sussmann  [20],  [21]  is  that  for 
analytic  systems  of  the  form  (1),  the  fcth-order  Lie  jet  at  p  determines  the  trajectories 
emanating  from  p  up  to  order  0(tk+l)  and  up  to  diffeomorphisms  of  the  state  space. 
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Sussmann  [22],  [23],  Bressan  [4],  and  Schattler  [16],  [17]  have  shown  that  the 
local  structure  of  time-optimal  controls  in  dimension  two  or  three  is  determined  in 
nondegenerate  situations  by  the  second,  respectively,  third-order  Lie  jet  at  a  reference 
point.  In  degenerate  situations  higher-order  jets  need  to  be  considered  [16],  [17],  [23]. 

On  the  basis  of  these  results  we  might  conjecture  that  in  nondegenerate  situations 
the  kth-order  Lie  jet  at  p  determines  the  structure  of  the  set  of  small-time  reachable 
points  where  the  Hormander  or  controllability  condition  is  satisfied,  i.e.,  the  rank  of 
the  Ath-order  Lie  jet  at  p  equals  the  dimension  of  the  state  space.  And  maybe  the 
qualitative  structure  of  the  reachable  set  can  be  obtained  by  looking  at  a  Ath-order 
nilpotent  approximation.  Unfortunately,  as  we  mention  in  the  last  section,  these 
conjectures  are  not  completely  true,  but  they  do  motivate  much  of.  our  work. 

The  paper  is  organized  as  follows.  The  next  section  reviews  the  Pontryagin 
Maximum  Principle  as  applied  to  the  system  { 1 ).  This  also  gives  us  a  chance  to  introduce 
some  notation  and  terminology.  In  §  3,  we  will  describe  the  main  ideas  and  outline 
the  general  structure  of  our  techniques  by  looking  at  the  trivial  two-dimensional  case. 
We  will  also  give  a  brief  proof  of  Lobry’s  three-dimensional  result.  The  main  part  of 
the  paper  is  §  4,  where  we  determine  the  geometric  structure  of  the  small-time  reachable 
set  for  the  nondegenerate  four-dimensional  system  (assuming  that  both  quadruples 
(/,  g ,  [/.  g],  U+g,  [/,  g]])  and  (/,  g,[f  g],  [/“  g,  [/,  g]])  consist  of  independent  vectors 
at  p).  We  also  draw  the  obvious  corollaries  about  time-optimal  control  in  dimension 
three.  Section  5  concludes  with  a  brief  discussion  of  the  free  nilpotent  five-dimensional 
system  and  explains  why  the  general  nondegenerate  five-dimensional  case  is  different 
from  this  one. 

2.  The  maximum  principle.  The  Maximum  Principle  [13]  gives  necessary  condi¬ 
tions  for  a  point  to  lie  on  the  boundary  of  the  reachable  set.  Let  u(  • )  be  an  admissible 
control  defined  on  an  interval  [0,  T]  and  let  x(  • )  be  the  corresponding  trajectory 
starting  at  p.  If  x(T)ed  Reach  ( p ),  then  x(t)ed  Reach  ( p )  for  all  /e[0,  T]  and  there 
exists  an  absolutely  continuous  curve  A  :[0,  7~]-»IR",  which  does  not  vanish  anywhere 
such  that 

(2)  A (t)T  «  -A(/)T(Df(x(t))+  Dg(x(t))  ■  u(r)), 

(3)  (A ( f),  g(x(l)))u(f)  =  Min  (A(t),  g(x(t)))v. 

Iris  i 

(4)  H  =  (\(t)J(x(t))  +  g(x(t))u(t))  =  0 

almost  everywhere  on  [0,  7*].  (We  write  vectors  as  columns,  (• ,  ■)  denotes  the  standard 
Euclidean  inner  product  on  R",  and  Df  and  Dg  denote  the  Jacobian  matrices  of  /  and 
g,  respectively.)  Any  trajectory  for  which  an  adjoint  variable  A(-)  exists  such  that 
(2)-(4)  are  satisfied  is  called  an  extremal  trajectory.  The  optimality  condition  (3) 
determines  the  control  u({)  whenever  4>{t)  (A(f),  g(x(t))}  r*  0;  </>  is  called  the  switch¬ 

ing  function  and  u  =  - 1  (u  =  +l)  on  intervals  where  <b  is  positive  (negative).  Trajec¬ 
tories  corresponding  to  these  constant  controls  are  called  bang  arcs  and  are  denoted 
by  X  ( =f-g )  and  Y  (  =  /+g),  respectively.  A  concatenation  of  bang  arcs  is  a 
bang-bang  trajectory.  Observe  that  (A(/),/(x(r)))  =  0  at  switching  times  /,  i.e.,  where 
(A (t),  g(x(r)))  =  0.  At  these  times  (3)  gives  no  information  about  the  optimal  control. 
If,  however,  4>  vanishes  on  an  open  interval  /,  then  all  the  derivatives  of  <t>  also  vanish 
on  1  and  this  may  determine  the  control  u.  We  have 

<M/)  =  (A(/),[/,g](x(0)>, 

<M0=(A(r),[/+gu,[/,  g]](x(r)». 
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and  if  < A  ( / ),  [g,  [f  g]](x(f))>  does  not  vanish  on  /,  we  can  solve  for  u  in  4>  =  0as  follows: 

<A (!),[/[/  g]](x(t))) 

<A(D,[g,[/,g]](x(0)>' 

A  control  of  this  type  is  called  singular  and  the  corresponding  trajectory  is  a 
singular  arc. 

This  suggests  that  concatenations  of  bang  and  singular  arcs  are  the  natural 
candidates  for  trajectories  in  the  boundary  of  the  reachable  set  (but  of  course  no  such 
regularity  statement  can  be  drawn  from  the  Maximum  Principle  alone).  We  denote 
concatenations  of  bang  and  singular  arcs  by  the  corresponding  letter  sequence;  for 
instance,  we  simply  write  XSY  for  a  concatenation  of  an  X-arc,  followed  by  a  singular 
arc  and  a  y-trajectory,  etc. 

3.  The  main  ideas  of  the  technique:  the  nondegenerate  two-  and  three-dimensional 
cases.  In  this  section  we  analyze  the  (well-known)  structure  of  small-time  reachable 
sets  in  a  nondegenerate  situation  in  dimensions  two  and  three.  These  cases  are  easy 
and  give  us  an  opportunity  to  outline  the  general  ideas  of  our  technique  without  getting 
preoccupied  with  technical  details. 

Suppose  2  is  a  system  of  the  form  (1)  in  dimension  two  and  assume  that  /  and 
g  are  independent  at  a  reference  point  p  (see  Fig.  1).  It  is  clear  how  the  small-time 
reachable  set  from  p  will  look.  If  we  let  (respectively,  T“)  be  the  integral  curves 
of  the  vector  fields  f+g  (respectively,  f-g)  for  positive  times,  then  for  sufficiently 
small  T,  Reach  (p,  ST)  is  the  union  of  T*,  f,  and  the  open  sector  R  between  T"-  and 
r  into  which  f(p)  points.  It  is  easy  to  see  that  any  point  in  R  is  reachable  from  p\ 
for  instance,  if  q  e  R,  just  run  a  trajectory  of  2  corresponding  to  the  control  u  =  +l 
backward  in  time  until  it  hits  T".  The  important  point  is  that  this  is  all  of  the  small-time 
reachable  set.  This  follows  immediately  from  the  Maximum  Principle  since  only 
trajectories  corresponding  to  the  constant  controls  u  =  +l  or  ua-1  can  lie  in  the 
boundary  of  the  reachable  set.  (There  cannot  be  a  junction,  since  then  both 
(A ( r), /( jc( r ))>  and  (A(r),  g(x(r)))  vanish,  contradicting  the  nontriviality  of  A.) 


Generalized  to  higher  dimensions,  the  quintessence  of  this  argument  is  to  have 
two  hypersurfaces  T*  and  T*  which  are  generated  by  extremal  trajectories,  have  a 
common  relative  boundary  and  “enclose"  a  region  R.  Then,  to  prove  that  R  is  actually 
the  reachable  set  Reach  (p,  ST),  we  must  show  (i)  trajectories  cannot  leave  R  through 
r*  or  T*,  and  (ii)  all  points  in  the  sector  are  reachable.  The  latter  is  immediate  if  we 
have  a  drift  vector  field  /  with  f(p)  ^  0.  This  is  exactly  the  same  argument  as  in  the 
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two-dimensional  case.  Take  any  point  q  inside  R  and  run  a  trajectory  of  X  corresponding 
to  the  control  u^O  (or  for  that  matter  corresponding  to  any  control)  backward  in 
time.  Since  f{p)^  0,  this  trajectory  will  hit  T*  or  T*.  So  basically  (i)  must  be  checked; 
this  is  mostly  a  matter  of  computing  tangent  spaces,  as  will  be  shown  below.  This  is 
the  general  strategy  of  our  technique. 

All  technical  issues  left  aside  for  a  moment,  the  key  question  is  how  to  come  up 
with  the  surfaces  T*  and  T*.  We  propose  an  inductive  procedure.  Let  us  explain  it  at 
the  next  step,  which  is  the  case  of  a  three-dimensional  system  X,  where  we  assume 
that  /,  g,  and  [f,  g]  are  independent  at  a  reference  point  p.  (This  is  the  example 
considered  by  Lobry  [14].) 

Choose  coordinates  x  =  (x, ,  x2,  x3)  such  that  (dx,  ( f(p ),  g{p),  [f,  g](p)>  =  Id,  the 
identity  matrix.  The  projection  of  X  into  the  (x, ,  x2)-plane  is  then  the  two-dimensional 
system  considered  above  and  we  know  the  structure  of  its  small-time  reachable  set. 
Our  aim  is  to  find  two  hypersurfaces  T*  and  T*  consisting  of  extremal  trajectories  that 
project  onto  the  reachable  set  R  of  the  two-dimensional  system  in  dimension  three.  If 
T*  and  T*  have  a  common  relative  boundary  that  projects  onto  SR  and  if  T*  and  T* 
do  not  intersect  in  their  relative  interior,  then  it  is  clear  that  these  surfaces  “enclose” 
a  region  R.  Then  we  must  check  whether  trajectories  can  leave  R.  If  this  is  impossible, 
R  is  the  small-time  reachable  set. 

The  Maximum  Principle  gives  preliminary  information  about  T*  and  T+  because 
it  describes  necessary  conditions  for  trajectories  to  lie  in  the  boundary  of  the  reachable 
set.  In  this  three-dimensional  case  it  actually  determines  T*  and  T*  precisely,  but  in 
higher  dimensions  this  is  no  longer  true.  It  is  then  that  we  will  use  nilpotent  systems 
as  our  guide  to  find  candidates  for  T*  and  T*.  More  on  that  appears  in  §  4. 

Now  that  we  have  outlined  the  general  approach,  let  us  also  illustrate  the  basic 
technical  arguments  by  reproving  Lobry’s  result.  It  follows  from  the  Maximum  Principle 
that  all  trajectories  that  lie  on  the  boundary  of  the  reachable  set  are  bang-bang.  For, 
if  the  switching  function  vanishes  at  some  t,  i.e.,  if  (A ( r ),  g(x(r)))  =  0,  then  also 
(A(t),/(x(/)))  =  0,  and  hence  )  =  (A(/},  [/  g](x(/))>  cannot  vanish  by  the  indepen¬ 

dence  of  f,  g,  and  (/,  g]  and  the  nontriviality  of  A.  For  dimensionality  reasons  it  is 
therefore  reasonable  to  consider  the  following  two  surfaces  as  candidates  for  T*  and  T*: 

T*  =  {p  exp  (*,(/-  g))  exp  ( s2(f+  g)):  s,  §  0,  s,  +  s2  small}, 

r*  =  {pexp  (f,(/+  g))  exp  (t2(f~  g)):  §  0,  r,  + 12  small}. 

We  write  flows  of  vector  fields  as  exponentials  and  we  let  the  diffeomorphisms  act  on 
the  right,  i.e.,  p  exp  ( tf)  denotes  the  point  obtained  by  following  the  integral  curve  of 
/  that  passes  through  p  at  time  zero  for  t  units  of  time. 

It  is  clear  that  T*  and  T*  are  two-dimensional  surfaces  with  boundary.  In  both 
cases  the  boundary  consists  of  the  two  curves  corresponding  to  the  trajectories  of  f  +  g 
and  f—g  and  the  point  p.  Furthermore,  by  the  Campbell-Hausdorff  formula  [11] 

P  exp  ( J,(/-  g))  exp  (s2(f+g)) 

=  pexp((st  +  s2)/+(s2-s,)g  +  s,s2[/,g]  +  s1s2  •  O(T)), 

p  exp  ((f,(/+g))  exp  (f2(/-g))  =  p  exp  (f,  +  t2)f+  (/,  -  f2)g  -  M2[/,  g]  +  M2  •  0(7)) 

where  0(7)  stands  for  terms  that  are  linear  in  the  total  time  7.  This  shows  that  T* 
and  T*  do  not  intersect  in  their  relative  interior.  So  T*  and  T*  enclose  a  region  R. 

To  prove  that  the  enclosed  sector  R  is  the  small-time  reachable  set  we  must  show 
that  there  cannot  be  any  other  points  in  the  reachable  set.  As  in  the  two-dimensional 
case  we  have  two  options:  either  we  show  that  we  have  exhausted  all  trajectories  that 
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possibly  can  lie  on  the  boundary  of  the  reachable  set,  or  we  show  that  trajectories 
starting  at  points  on  I”*,  T* ,  T+,  or  T_  cannot  leave  R  U  T*  U  T*  U  T+  U  T_ .  As  it  turns 
out,  this  is  the  same  argument,  only  viewed  differently. 

Let  us  first  show  that  we  have  exhausted  all  possible  trajectories  that  can  lie  in 
the  boundary  of  the  small-time  reachable  set,  i.e.,  that  such  a  trajectory  is  bang-bang 
with  at  most  one  switching.  Let  y  be  a  bang-bang  trajectory  with  two  switches,  say  of 
the  form  XYX ,  with  junctions  p0  and  p,  at  times  t0<tt.  If  A  =  A(f,),  then  we  have 
(A,  g(pi))  =  0  and  (A,/(p,)>  =  0.  Also  (A(t0),  g(po))  =  0  or,  equivalently,  if  we  move  g 
ahead  along  the  flow  of  the  vector  field  Y  we  get  (A,  exp  ( — (r,  -  f0)  ad  Y)X(p0))  =  0. 
But  A  f*  0  and  so  these  three  vectors  are  dependent:  p0  and  p,  are  conjugate  points 
(Sussmann  [22]).  Therefore 

AT(p,)a  Y(p,)  a  exp  (-At  ad  Y)X(po)  =  0 

i.e.,  X(p,)  a  Y(p,)  a[X,  Y](p,)  +  O(At)  =  0,  where  At  =  tt-t0.  But  such  a  relation 
cannot  hold  in  small  time  by  the  independence  of  X,  Y,  and  [X,  Y-].  Similarly  it  follows 
that  YXY- concatenations  cannot  satisfy  the  Maximum  Principle. 

This  computation  can  also  be  viewed  in  the  following  way.  Define  a  map 
F:  (/,,  t2,  t3)>-*p  exp  (t,X)  exp  (t2  Y)  exp  (t3X)  for  f,  small.  Then  this  map  has  full 
rank  if  t(>0.  For,  if  we  compute  the  tangent  space  to  the  image,  but  pull  back  to 
p  exp  (r,X)  exp  ( t2  Y),  we  get  exactly  the  vectors  exp  ( -t2  ad  Y)  X,  Y,  and  X.  Therefore 
F(t , ,  t2 ,  r3)  is  an  interior  point  of  the  reachable  set  Finally,  if  we  pull  back  the  tangent 
space  one  step  further  to  p  exp  (fiX)  we  have  the  vectors  X,  Y,  and  exp  (t2  ad  Y)X  = 
X  -  t2[X,  Y]  +  0(/2).  The  minus  sign  at  [X,  Y]  implies  that  X-trajectories  point  inside 
R  at  points  on  T*.  Similarly,  it  follows  that  Y- trajectories  steer  the  system  into  R  from 
T*.  And  this  proves  that  trajectories  of  the  system  cannot  leave  R  through  T*.  T*,  T+, 
or  T_.  (Because  of  the  Maximum  Principle  we  can  restrict  ourselves  to  just  looking 
at  these  regular  controls  instead  of  having  to  consider  arbitrary  measurable  functions. 
For,  if  any  trajectory  would  leave  R,  then  there  will  also  have  to  be  additional  trajectories 
lying  on  the  boundary  of  the  reachable  set  and  these  must  be  bang-bang.) 

The  structure  of  the  small-time  reachable  set  as  a  stratified  set  can  easily  be 
described  using  the  following  notation.  For  neM  let 

S„  _  :=  {p  exp  (r,X)  exp  (s2  Y)  exp  (r3X) 

•  •  •  exp  ( snB ):  s,  >  0,  B  =  X  if  n  is  odd,  B  =  Y  if  n  is  even}, 

Sn,+  '■=  { p  exp  ( f ,  Y)  exp  ( f2X )  exp  ( f3  Y) 

•  •  •  exp  ( t„B ):  t,  >  0,  B  =  X  of  n  is  even  B  =  Y  if  n  is  odd}. 

In  a  nondegenerate  situation  each  of  the  S„  ±  is  a  n-dimensional  smooth  manifold. 
(Certainly  this  will  be  true  in  all  the  cases  we  consider  here.)  In  the  three-dimensional 
case  the  boundary  of  the  small-time  reachable  set  consists  of  the  two  two-dimensional 


126 


A.  J.  KRENER  AND  H.  SCHATTLER 


strata  S2 .*  which  have  in  their  boundary  the  two  one-dimensional  strata  S,  *  and  the 
zero-dimensional  stratum  S0  =  {p}.  50  also  lies  in  the  boundary  of  S,.t.  If  we  restrict 
the  total  time  to  be  gf  we  must  make  the  obvious  adjustments.  In  particular,  we  must 
add  the  strata  5„.±  :=  S„  *  H  Reach  (p,  T)  for  n  =  1,2. 

4.  The  nondegenerate  four-dimensional  systems.  In  this  section  we  determine  the 
geometric  structure  of  the  small-time  reachable  sets  from  a  point  p  for  a  system  X  of 
the  form  (1)  in  dimension  four,  where  we  assume  that  the  constant  controls  u^+l 
and  u  3=  —  1  are  not  singular.  These  conditions  can  easily  be  expressed  in  terms  of 
independence  assumptions  on  f  g,  and  lower-order  brackets  of /  and  g.  For,  a  constant 
control  u  =  u°  is  singular  on  an  interval  /  if  and  only  if  there  exists  an  adjoint  multiplier 
A  such  that  (A,/),  (A,  g>,  (A,  [/ g]),  and  <A[/+  gu°,  [/,  g]]>  vanish  identically  on  /.  By 
the  nontriviality  of  A  this  is  impossible  if  f,  g,  [  f  g],  and  [/+gu°,  [/,  g]]  are  indepen¬ 
dent.  Therefore  in  terms  of  the  vector  fields  X  and  X  our  conditions  are  equivalent  to 

(A)  X,  Y,  [X,  Y ]  and  [X,  [X,  Y]]  are  independent  near  p; 

(B)  X,  Y,  [X,  X]  and  [  X,  [X,  V]]  are  independent  near  p. 

If  we  write  [X,  [X,  X]]  as  a  linear  combination  of  X,  V,  [X,  X]  and  [  Y,  [X,  X]]  as 
[X,  [X,  X]]  =  aX  +  /3Y  +  y[X,  X]  +  S[X,[X,  Xj], 
then  (A)  is  equivalent  to  5  5*  0. 

The  cases  5>  0  and  S<0  are  significantly  different:  if  5>0  only  bang-bang 
trajectories  can  lie  in  the  boundary  of  the  reachable  set,  if  5  <  0  singular  arcs  are 
possible.  Intuitively  this  is  clear.  If  u  is  singular  on  an  interval  /,  then  (omitting  the 
arguments  t  and  x{t)) 

<fi  =  (K[f+gu,[fg]]) 

=  i<A,(l-u)[X,[X,  X]]  +  (l  +  m)[  X,  [X,  X]]) 

=  J((l  -  u)5  +  (1  +  «))  •  (A,  [  X[X,  X]]>  ft  0 

and  so  u  =  (5  + l)/(5 - 1).  This  is  an  admissible  control  only  if  5^0.  Note  that  the 
singular  vector  field  is  given  in  feedback  form  as 


S-/+— g 

5-1  * 


1-5 


-5 

*+ — r  Y, 

1-5 


5  <  0. 


4.1.  The  totally  bang-bang  case:  5>0.  This  is  the  generalization  of  Lobry’s 
example  to  dimension  four.  We  treat  only  the  general  case  here,  but  we  remark  that 
the  structure  of  the  small-time  reachable  set  is  the  same  as  for  a  niipotent  system  where 
f  g.  [/»gL  ar>d  [/[/.  g]]  form  a  basis  and  all  other  brackets  vanish.  In  appropriate 
coordinates  the  latter  system  is  linear. 

The  key  observation  again  is  that  the  Maximum  Principle  precisely  determines 
the  possible  trajectories  that  can  lie  in  the  boundary  of  the  small-time  reachable  set. 

Lemma  1.  If  y  is  a  trajectory  that  lies  in  the  boundary  of  the  small-time  reachable 
set,  then  y  is  bang-bang  with  at  most  two  switches. 

Proof.  We  first  exclude  bang-bang  trajectories  with  more  switches.  Let  y  be  a 
XXXX-trajectory  with  switching  points  p, ,  p2,  and  p-,  and  let  s, ,  s:,  s,,  s4  be  the  length 
of  the  times  along  the  respective  X-arcs  or  X-arcs.  At  every  junction  we  have 
(A,  X ( p, ))  =  0  and  (A,  X(pj))  =  0.  This  gives  rise  to.  four  conditions  on  A. 
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If  A  is  the  value  of  the  adjoint  vector  at  the  switching  time  at  p2,  we  have 

a,x(p2))=a,  y(P2))= o, 

(A,  exp  (~s2  ad  y)X(p,)>  =  0, 


(A,  exp  (s3ad  X)y(p3)>  =  0. 

Again,  the  nontriviality  of  A  implies  that  these  four  vectors  are  dependent  (“conjugate 
points”).  So  we  get  (dividing  out  s2  and  s3) 


0  =  X  a  Ka 


'exp  (s3  ad  X)  - 


I)Va(2E<Z£L 


— s2  ad  y)-l 


(5)  =  x  a  y  a  [x,  y]+~s,[x,[x,  v]]+o(s2)a-[x,  y]+^a[y,[x,  y]]+o(rs) 


=-o-(s2,s3)(^a  yA[x,  y]A[y,[x,  y]]),PJ 

where  T  is  the  total  time  along  y  and  0(T2)  stands  for  terms  that  are  quadratic  in  T; 
o-  is  a  smooth  function  of  s2  and  s3.  If  we  express  [X,  [X,  y]]  in  terms  of  X,  Y,  [X,  y], 
and  [  y,  [X,  y]],  we  see  that 

(6)  <r(s2,  s3)  =  s2  +  s}8  +  0(T2) 

where  5  is  evaluated  at  p2.  In  a  sufficiently  small  neighborhood  of  p,  8  is  bounded 
away  from  zero  and  so  the  linear  terms  dominate  quadratic  remainders  in  small  time. 
Hence  cr(s2,  s3)  is  positive  for  s,  small;  in  particular,  it  cannot  vanish,  a  contradiction. 

Analogously,  if  y  is  a  Xyxy-concatenation  with  switching  points  q,,  q2,  and  q3 
and  if  t,,  t2,  t},  r4  are  the  times  along  the  respective  trajectories,  then  we  get 

o- x  A  y » (exp (~^d x)~')  Y A (exP 


where 


= -t( h ,  f3)(x  A  y  A  [X,  Y]  A  [  y,  [X,  y]])|,2 


t(^2>  h)  ~  ~ h  t28  +  0(T2) 


is  a  smooth  function  of  f2  and  f3  near  the  origin.  Again,  since  5  is  bounded  away  from 
zero  near  p  this  function  is  negative  for  small  times,  a  contradiction. 


Fig.  3 
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It  now  follows  that,  in  fact,  any  trajectory  that  lies  in  the  boundary  of  the  small-time 
reachable  set  is  bang-bang.  This  is  an  easy  but  slightly  technical  argument.  We  will 
do  it  here  rigorously  since  we  will  need  the  computations  later  on  anyway.  The  point 
is  that  we  do  not  have  a  priori  knowledge  about  regularity  properties  of  the  controls, 
e.g.,  that  they  are  piecewise  constant.  This  is  the  case  if  and  only  if  the  zero  set 
of  the  switching  function  0  is  finite.  If  it  were  infinite,  then  the  set  N^,  of  limit  points 
of  Z(<j>)  would  be  nonempty.  In  fact,  it  is  a  closed,  nowhere  dense,  perfect  set.  (If 
/,  <  t2  are  points  in  N(0)  then,  since  0  cannot  vanish  identically,  0  is  different  from 
zero  somewhere  in  (/, ,  t2)  and  by  continuity  it  is  different  from  zero  on  a  whole  interval. 
It  is  perfect,  i.e.,  every  point  re  N(<j>)  is  a  limit  point  of  points  t„  e  N(0),  t„  ^  r,  since 
N(<f>)  cannot  have  isolated  points.  We  can  see  that  this  is  so,  since  we  know  already 
that  bang-bang  trajectories  with  more  than  three  switchings  do  not  lie  in  the  boundary 
of  the  small-time  reachable  set!)  Suppose  f,  <  t2  are  times  in  N(0).  There  exists 
a  ?€(/,,  r2)  such  that  0(0^0.  Let  t,  :=  sup  ([/, ,  f]fl  N(0))  and  let  f2:= 
inf  ([  T, ,  r2]  D  N(0)).  Then  f,  <  f2,  F,  e  N(0),  and  Z(0)  D  [T, ,  F2]  is  finite.  This  implies 
that  y  contains  subarcs  of  the  form  *B •  and  -B*,  where  B  denotes  a  bang  arc  ( X  or 
Y ),  •  stands  for  any  switching,  and  *  stands  for  a  junction  in  N(cf>).  Observe  that 
0(f)  =  0  if  re  N(0).  We  will  now  show  that  none  of  these  concatenations  can  lie  in 
the  boundary  of  the  reachable  set  and  this  will  prove  the  lemma. 

Without  loss  of  generality  we  consider  a  concatenation  of  the  form  *X-  with 
switching  points  p0  and  p,  and  let  t  be  the  time  along  X.  Then,  if  A  is  the  value  of  the 
adjoint  vector  at  the  switching  time  corresponding  to  p0,  we  have 

<A,X(p0))  =  <A,  Y(p0)>  =  <A,[X,  Y](po)>  =  0. 

Also  (A,  exp  (-f  ad  X)  Y(p,))  =  0  and  so  by  nontriviality  of  A  we  again  get 

0  =  X  a  f  a  [X,  Y]  a  Y-f[X,  Y]  +  £f2[X,[X,  Y]]  +  0(f3) 

(9)  ‘  - . 

=  5*2(i  +  o(t))(x  a  y a [x,  y]  a  [ y,  [x,  y]]),ro. 

This  cannot  hold  in  small  time.  Analogously  it  follows  that  no  *B-  or  •  B *  concatenation 
can  lie  in  the  boundary  of  the  small-time  reachable  set  if  5  #0.  This  proves  the  lemma 
(and  note  that  the  argument  is  valid  in  general  under  assumptions  (A)  and  (B)).  □ 

It  is  now  clear  that  the  surfaces  T*  and  T.  must  be  as  follows: 

T*  =  {p  exp  (j)X)  exp  (s2  Y)  exp  (s3X):  s,  s  0,  small}, 

T*  =  {p  exp  (r,  Y)  exp  (f2X)  exp  (r3  Y):  rf  g  0,  small}. 

T*  and  r*  are  three-dimensional  surfaces  with  common  boundary  C  that  has  precisely 
the  structure  of  the  boundary  of  the  small-time  reachable  set  in  dimension  three.  It  is 
the  union  of  two  two-dimensional  surfaces  made  out  of  XY-  and  YX-trajectories 
respectively,  glued  together  along  the  X-  and  Y-trajectories. 

We  will  now  show  that  T*  and  T*  do  not  intersect  away  from  C,  in  particular  that 
they  enclose  an  open  region  that  will  be  the  interior  of  the  small-time  reachable  set. 

Definition.  We  say  a  point  q  is  an  entry  point  (respectively,  an  exit  point)  of 
a  (closed)  set  S  for  a  vector  field  Z  if  for  some  e  >  0,  Sfl  {q  exp  ( iZ ):  -e  S  t  S  0}  =  {q} 
(respectively,  if  Sfl  {q  exp  ( tZ ):  0giSe}  =  {^}). 

Lemma  2.  For  sufficiently  small  T  the  points  in  T*  are  entry  points  for  the  small-time 
reachable  set  from  p  for  [  Y,  [X,  Y]].  The  points  in  T*  are  exit  points. 

Proof.  If  q  is  an  exit  (entry)  point  for  Reach  (p,  ST)  that  does  not  lie  in 
Reach  ( p,  T),  i.e.,  exit  or  entry  is  not  due  to  the  time  restriction,  then  the  corresponding 
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trajectory  is  extremal  and  the  adjoint  multiplier  satisfies  the  transversality  condition 
(A,  [V'  [X,  Y]](g))S0  ((A,  [  Y,  [X,  Y]](g))sO).  We  claim  that  necessarily 

qeT+iqer*). 

Recall  that  the  second  derivative  of  the  switching  function  is  given  by 
<M0  =  <A(0,[/+gu,[/,g]](*(0)> 

(10)  =Kl-u(/))(A,[X,[X,  Y]](x(f))> 

+i(i  +  u(t))(A,[y,[x,  y]](x(0)>. 

Expressing  [X,  [X,  Y]]  in  terms  of  X,  Y,  [X,  Y],  and  [  Y,  [X,  Y]],  we  get  a  linear 
combination  of  terms  <A,  X),  (A,  Y),  <A,  [X,  Y]),  and  (A,  [  Y,  [X,  Y]]>,  where  the 
coefficient  at  (A,  [  Y,  [X,  Y]j)  is 

5(l-w)5+l(l  +  u)^Min(l,5)>0. 

Suppose  y  is  a  bang-bang  trajectory  with  two  junctions.  Then  the  two  junctions 
determine  a  multiplier  A  up  to  a  positive  constant  multiple.  Normalize  such  that 
||A(0)||2=  1.  Because  y  has  two  junctions  (A,  X),  (A,  Y),  and  (A,  [X,  Y])  vanish  some¬ 
where  on  [0,  T],  T  =  f,  + 12+  h-  For  sufficiently  small  T  these  functions  will  be  bounded 
in  absolute  value  on  [0,  T]  by  any  e>0.  Because  of  (B)  |(A(f),  [  Y,  [X,  Y]](x(t)))|  can 
be  bounded  away  from  zero  on  [0,  7].  By  choosing  e,  i.e.,  T  small  enough, 
<A,  [  Y,  [X,  Y]])  dominates  all  other  terms  in  (10),  that  is,  we  have  in  small  time:  <£ 
has  constant  sign  equal  to  sign  ((A,  [  Y,  [X,  Y]]».  But  (A,  [  Y,  [X,  Y]])>  0  allows  only 
for  XYX-trajectories  and  (A,  [  Y,  [X,  Y]])<0  permits  only  YXY-concatenations.  This 
proves  our  claim. 

We  still  need  to  show  that  points  in  T*  and  T*  in  fact  have  these  optimization 
properties.  Suppose  y  is  a  XYX  trajectory.  Then  the  tangent  space  at  the  endpoint  is 
spanned  by  X,  exp(-t3adX)Y  and  exp  (-t3  ad  X)  exp  (~t2  ad  Y)X.  Note  that 
[  Y,  [X,  Y]]  always  points  to  one  side  of  the  tangent  space  since 

X  a  exp  (-t3  ad  X)  Y  a  exp  (-f3  ad  X)  exp  (~t2  ad  Y)X  a  [  Y,  [X,  Y]] 

=  -t2  ^ X  a  exp  (-/3  ad  X)  Y  a  exp  (-f3  ad  X)  ^  ^ x 

(ID  a[Y,[X,  Y]]) 

=  /2(Xa  Y-f3[X,  Y]+  O(tl)  a  [X,  Y]  +  0(7)  a  [  Y,  [X,  Y]]) 

=  /2(1  +  0(7))(Xa  Ya[X,  Y]  a  [  Y,  [X,  Y]]). 

If  we  write  the  defining  equations  for  T*  and  T*  in  terms  of  canonical  coordinates 
of  the  second  kind,  that  is,  as  products  of  the  flows  of  the  vector  fields  X,  Y,  [X,  Y], 
[Y,  [X,  Y]]  in  the  form 

(12)  p  exp  (x,X)  exp  (x2  Y)  exp  (x3[X,  Y])  exp  (x4[  Y,  [X,  Y]]), 

then  this  implies  that  we  can  think  of  T*  as  the  graph  of  a  function  x4  =  *p(x, ,  x2 ,  x3). 
It  also  follows  from  (12)  that  the  integral  curve  of  [Y,  [X,  Y]]  through  p  and  the 
compact  set  Reach  ( p ,  7)  are  disjoint  for  small  positive  7.  Therefore,  given  7,  there 
exists  a  f  S  7  with  the  following  property.  Any  integral  curve  of  [  Y,  [X,  Y]]  that 
passes  through  a  point  on  T*(  7),  the  set  of  all  trajectories  in  T*  of  total  time  £  7,  does 
not  meet  Reach  (p,  7).  This  implies  that  the  points  on  T*(7)  are  entry  points  for  the 
small-time  reachable  set.  For,  if  qe  T*(7)  is  not  an  entry  point,  then  by  compactness 
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there  exists  an  entry  point  of  Reach  (p,  ST)  of  the  form  q  exp  r[  Y,  [X,  V]].  Since  this 
flow  does  not  meet  Reach  (p,  T)  this  point  must  lie  on  T*  and  this  contradicts  the 
graph  property.  Analogously  the  result  follows  for  T*.  0 

An  easy  computation  shows  that,  if  T*  and  T,  would  intersect  away  from  C,  then 
it  would  have  to  happen  transversally.  This  would  contradict  Lemma  2. 

The  geometric  structure  of  the  small-time  reachable  set  is  now  clear.  It  ;s  the  exact 
analogue  of  Figs.  1  and  2  in  four  dimensions.  Its  boundary  consists  of  the  surfaces  1’* 
and  T*  that  match  up  along  C,  the  set  of  points  reachable  by  a  bang-bang  trajectory 
with  at  most  one  switch.  The  open  region  enclosed  by  T*  and  f%  is  the  interior  of  the 
reachable  set.  A  stratification  of  its  boundary  is  given  by  S„  and  S„,  for  n  =  l,2,  3 
(see  §  3). 

Remark.  This  qualitative  structure  of  the  small-time  reachable  set  for  a  totally 
bang-bang  system  generalizes  to  arbitrary  dimensions  under  the  conditions  of  Krener's 
and  Sussmann's  nonlinear  bang-bang  theorem  [19],  Suppose  that  the  vector  fields  / 
and  ad 'f(g),  i  =  0,  •  •  • ,  n  —  1  are  independent  at  p  and  that  for  i  =  0,  •  •  • ,  n  —  1  there 
exist  smooth  functions  av  and  /3,  with  |/3;(p)|<  1  such  that 

[g,ad'/(g)]=  t  a.j  ad'/(g)  +  0,  ad,'"'/(g). 

i-a 

Then  it  follows  that  for  sufficiently  small-time  T  all  trajectories  that  lie  in  the  boundary 
of  the  reachable  set  from  p  are  bang-bang  with  at  most  n  switchings.  A  stratification 
of  the  boundary  is  given  by  the  strata  S0  =  {p}  and  Sk± ,  k-l,  •  •  • ,  n,  In  particular, 
points  in  S„,+  are  exit  points  of  the  reachable  set  for  (-1)""'  ad ""'/(g),  points  in  S„,_ 
are  entry  points.  Given  the  results  on  the  structure  of  trajectories  in  the  boundary,  this 
is  a  straightforward  generalization  of  the  argument  above.  AH  the  difficult  work  has 
been  carried  out  by  Sussmann  in  [19),  specifically  in  the  proof  of  Lemma  3  there. 

4.2.  The  bang-bang  singular  case:  5<0.  This  case  is  a  nontrivial  extension  of 
Lobry’s  result.  Here  not  all  the  extremal  trajectories  actually  lie  in  the  boundary  of 
the  small-time  reachable  set.  It  is  therefore  not  clear  how  we  should  choose  T*  and 
T*.  We  now  use  the  structure  of  the  small-time  reachable  set  for  the  corresponding 
free  nilpotent  system  as  a  guide.  The  only  reasonable  nilpotent  approximation  to 
choose  is  one  where  all  brackets  of  orders  greater  than  or  equal  to  4  vanish.  Note  that 
f  g>  [/>  g]>  and  [g,  [f  g]]  are  always  independent  in  this  case.  Since  we  want  to  work 
with  a  system  as  simple  as  possible,  we  also  assume  [/  [/  g]]  ^  0.  This  is  an  equality 
relation  in  the  third-order  Lie  jet,  but  in  a  slightly  more  general  setup  (weighted  Lie 
algebra)  this  would  be  a  free  nilpotent  system.  Therefore  we  refer  to  this  system  as 
the  “free”  nilpotent  case.  We  will  first  analyze  a  model  of  this  “free"  nilpotent  case, 
and  then  we  will  show  that  the  general  case  has  the  same  qualitative  behavior. 

4.2.1.  The  reachable  set  in  the  “free”  nilpotent  case.  To  simplify  some  computations 
we  restrict  ourselves  to  the  following  model  X: 

(13)  ,x„=l,  x,  =  u,  .x:  =  .x,,  .x,  -  yx;. 

Note  that  [g,/](x)  =  (d/dx2)  +  .x,(d/dx,),  [g,  [g,/]]  s  d/dx2  and  all  other  brackets  vanish 
identically.  It  is  clear  that  the  qualitative  structure  of  the  reachable  set  from  the  origin 
at  any  time  is  the  same  as  for  the  small-time  reachable  set:  one  is  a  rescaling  of  the 
other.  (If  u  is  a  control  defined  on  [0,  T]  and  ,x  is  the  corresponding  trajectory,  then 
the  time  1  reachable  set  can  be  obtained  from  the  time  T  reachable  set  by  letting 
u(t)'=  u(t/T)  and  x,(0  T'x,(//T)  for  i  =  1, 2,  3.)  To  determine  the  reachable  set  it 

therefore  suffices  to  look  at  time  slices  T  =  constant,  and  without  loss  of  generality  we 
can  assume  T=  1. 
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If  A  =  (A„,A,,A,,A,)r  is  an  adjoint  vector  for  an  extremal  trajectory  x(  • ),  then 
A  |  is  the  switching  function  and 

A,  =  -A:- A,*,,  Ay  =  0,  Aj  =  0, 

and  in  particular  A  ,  =  A  ,u,  i.e..  m  =  0  is  the  only  singular  control.  Note  that,  if  A ,  =  0, 
then  A |  is  a  linear  function  and  the  extremal  trajectory  is  uniquely  determined.  By  a 
theorem  of  Bressan  [2]  this  implies  that  the  reachable  set  is  convex  in  direction  of 
(0,0,0,  l)1  or  equivalently  in  the  direction  of  [g,  [?,/]]  =  ;(X,  [X,  V]];  that  is,  if 
(Pn.Pi.Pi.a)  and  ( p„.  p, ,  p:,  h)  lie  in  the  reachable  set,  then  the  whole  segment 
{(  P<\ .  P\ .  P:.  c):  nScSli)  lies  in  the  reachable  set.  It  is  therefore  clear  what  the  surfaces 
I'*  and  I',  have  to  be:  1'*  consists  of  trajectories  which  are  exit  points  for  [X,  [A’.  K]] 
and  I*  of  those  which  are  entry  points.  Equivalently,  we  can  speak  of  trajectories  that 
maximize/minimize  the  coordinate  x, . 

For  extremal  trajectories  that  give  rise  to  entry/exit  points  for  [X,  [X,  V]],  an 
additional  transversality  condition  was  to  hold.  One  of  the  directions  ±(X,  [X,  y]] 
can  be  separated  from  an  approximating  cone  to  the  reachable  set  at  this  point.  In 
our  case  these  conditions  simply  say  that  A,S0  for  trajectories  that  minimize  x,  and 
A,  %  0  for  those  that  maximize  x, .  In  particular  A3  =  0  for  those  that  do  both  and  these 
trajectories  are  bang-bang  with  at  most  one  switching.  So  again  the  common  boundary 
of  T*  and  I'*  will  be  a  set  C  that  has  the  structure  of  the  boundary  of  the  small-time 
reachable  set  in  dimension  three. 

We  now  determine  Ft .  We  can  assume  A,>0  and  without  loss  of  generality 
normalize  A,  to  1.  Thus,  A,  =  -  u  and  so  A,  is  strictly  convex  and  positive  along  X, 
strictly  concave  and  negative  along  Y.  Singular  controls  satisfy  the  generalized 
Legendre-Clebsch  condition  [13]:  (A,  [g,  [f  g]]>  =  -A 5  <  0.  It  follows  that  the  only 
extremal  trajectories  are  concatenations  of  a  bang  arc,  followed  by  a  singular  arc  and 
another  bang  arc.  We  now  restrict  to  the  time  slice  T  =  1.  Define 

r_„_  :=  {0  exp  (s,X)  exp  (s2Jj  exp(ijX):  s,  SO,  s,  +  s2  +  s5  =  1], 

r_0*  :=  {0  exp  (r,X)  exp  (s2f)  exp  (s,  Y):  s,  S  0,  s,  +  s2  +  s,  =  1], 

Tn, .  :=  {0  exp  it,  Y)  exp  ( t2f)  exp (f3X):  l,  SO ,  I,  +  t2  +  h  =  0. 

I\0+:=  {Oexp  (f,  Y)  exp  (r:/)  exp^T):  r,  SO,  t.  +  tj+r,  =  I}. 

We  will  show  that  these  are  two-dimensional  surfaces  with  boundary  which  match  up 
and  together  form  I'»  with 

3T*  =  {0  exp  (s,X)  exp  (s2  Y):  s,  SO,  s,  +  s:  =  1 } 

U  {0  exp  (/,  Y)  exp  (/:X):  I,  SO,  /,  +  r;  =  1}. 

Lemma  3.  Each  of  the  sets  is  a  two-dimensional  surface  with  boundary.  For 
any  two  of  them  the  images  of  the  open  simplices  are  disjoint.  Furthermore , 

I  „  HI'  -  I'  „  =  {0  exp  (s,X)  exp  (J;/):  s,  S  0,  j,  +  s,  =  1 }, 

1'  fl  I  ..I  -  =  f„-  =  {0  exp  ( s,f )  exp  (s:X):  s,  S  0,  s,  +  s;  =  1 }, 

r  „  n  r.„.  =  r„  =  {o  exp  (.?/>:  os  is  1}  =  r.„,  n  i\„. , 

r  n  r.„.  -  r„»  =  {0  exp  (.?,/)  exp (j2  y )-.  s,  s  0,  s,  +  s2  =  1  (, 

r.„  fl  I'.„,  =  I'.,,  =  {0  exp  (.r,  X)  exp  ( s:Jj:  s,  S  0,  r,  +  r:  =1}. 

Graphically,  these  relations  can  be  illustrated  as  shown  in  Fig.  4. 
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The  proof  of  the  lemma  consists  of  straightforward  computations  that  we  shall 
only  illustrate  in  one  case.  It  is  easy  to  see  that  all  the  maps  are  regular  with  rank  2 
in  the  interior,  and  it  is  clear  how  the  maps  behave  on  the  boundary.  So  the  r*0±  are 
two-dimensional  surfaces  with  boundary.  To  prove  that  the  images  of  the  open  simplex 
under  different  maps  are  disjoint,  we  choose  a  way  that  does  not  use  the  specific  form 
of  the  equations,  but  works  with  a  basis  provided  by  the  vector  fields  /,  g,  [f,  g],  and 
[g,  [/.  g]]-  Th's  also  gives  an  idea  how  the  analogous  argument  in  the  general  case 
runs.  We  rewrite  the  defining  equations  in  terms  of  canonical  coordinates  of  the  second 
kind  as  products  of  the  flows  of  the  vector  fields  f  g,  [/  g],  and  [g,  [ f  g]].  Since  in 
this  case 

(14)  exp  (/+g)  =  exp  ([g,  [/,  g]]/3)  exp  ([/,  g]/2)  exp  (g)  exp  (/), 

we  get,  for  instance,  for  r+o+: 

0  exp  (r,(/+g))  exp  (i2f)  exp(t3(/+g)) 

=  0  exp  ((jfftg,  [/,  g]])  exp  (jfit/  g])  exp  (t,g)  exp  ((/,  +  l2)f)) 

*  exp  (jt3[g,  [/  g]])  exp  Cj'lU,  g])  exp  (r3g)  exp  (f3/> 

=  0  exp  +}/3))[g,[/,g]])  exp  ((s(/l  +  /JI)i+/2/t)[f,g]) 

xexp((f,  +  f3)g)  exp  (/). 

Analogously  we  have  for  r 

0  exp  (s,(/-g))  exp  (s-J)  exp  (s3(/+  g)) 

=  0  exp  ((jsj  -  j;j,  +  |s3  +  1j2j3  -  J,s2s3)[g.  [/,  g]]) 

x  exp  ((-js;  +  ls3  t-(s,  +  s:)sx)[f  g])  exp  ((s,  -s,)g)  exp  (/). 

A  simple  computation  shows  that  the  equations  we  obtain  by  equating  the  coordin¬ 
ates  have  no  positive  solution.  Similarly  this  is  shown  for  all  pairs  of  surfaces.  The 
statements  about  the  intersections  are  then  clear.  □ 

This  shows  that  T,  is  a  two-dimensional  stratified  set  with  its  one-dimensional 
relative  boundary  dl\  made  out  of  bang-bang  trajectories  with  at  most  one  switching. 
Figure  4  gives  a  precise  description  of  the  stratification.  We  now  show  that  the  points 
on  T*  are,  in  fact,  the  points  that  have  the  smallest  x3  coordinate  among  all  points  of 
Reach  (0,  1 )  with  a  fixed  (x„,  x, ,  Xj). 
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Let  us  first  compute  the  tangent  spaces  to  the  surfaces  ri0*.  Note  that  in  each 
case  the'  pullback  of  the  tangent  space  to  the  endpoint  of  the  singular  arc  simply 
consists  of  the  space  spanned  by  the  vectors  g  and  [/  g]  evaluated  there  (remember 
that  we  are  working  in  the  time  slice  T  =  1).  This  implies  that  [X,  [X,  T]]  =  2[g,  [g,/]] 
always  points  to  one  side  of  the  tangent  space.  In  fact, 

exp  (-tad  (/±  g))g  a  exp  (-t  ad  (/±  g))[/,  g]  a  [g,  [/,  g]]  =  l(g  a  [/  g]  a  [g,  [/,  g]]). 

In  the  limit  this  also  holds  for  the  one-dimensional  strata.  Therefore  [g,  [g,/]]  always 
points  to  one  side  of  the  stratified  surface  It  is  easy  to  see  that,  in  fact,  we  can 
think  of  F*  as  the  graph  of  a  piecewise  defined  function  x,  =  i/>(x, ,  x2).  (The  projections 
of  the  images  onto  (x,,  x2)  intersect  only  along  the  projections  of  the  intersections  of 
the  surfaces  r,0l.)  Since  we  have  exhausted  all  possible  extremal  trajectories  that  can 
minimize  the  coordinate  x3  with  P*,  it  is  now  clear  that  given  (x, ,  x2>  x3)e  F*  any 
other  point  (x, ,  x2,  x3)  e  Reach  (0,  1)  with  x,  =  x,  and  x,  =  x2  must  satisfy  x3>  x3.  This 
concludes  the  analysis  of  T*. 

Next  we  will  determine  T*.  Here  we  can  assume  A3  =  -1  and  so  A,  =  u,  i.e.,  the 
switching  function  <f>  is  convex  when  <t>  is  negative  and  concave  when  <t>  is  positive. 
This  clearly  suggests  bang-bang  extremals.  However,  now  the  situation  is  significantly 
different  from  ail  previous  cases:  it  will  turn  out  that  the  times  along  bang  arcs  are  no 
longer  free,  which  in  turn  will  mean  that  we  cannot  a  priori  exclude  bang-bang 
trajectories  with  a  large  number  of  switchings.  In  general,  it  is  a  very  difficult  problem 
to  eliminate  extremal  trajectories  with  a  large  number  of  switchings  (cf.  [4]  or  [16]). 
It  turns  out  that  in  our  approach  we  do  not  even  have  to  address  this  issue. 

Let  us  start  by  showing  that  the  times  along  bang  arcs  can  no  longer  vary  freely. 
Suppose  we  have  a  concatenation  of  a  V-trajectory  followed  by  an  X-arc  with  switchings 
at  the  beginning  and  the  end  (-Xy-).  Call  the  switching  points  p0,  p t,  and  p2  and  let 
s  and  t  be  the  times  along  X  and  Y,  respectively.  Then  p0,  p,,  and  p2  are  conjugate 
points  and  therefore 

0  =  exp  (— s  ad  X)  Y  a  X  a  Y  a  exp  (f  ad  Y)X 

^expH.dXl-lj^^^^pttadyhl^ 

!ij'  =XaVa[X,  n  +  4[*.[/*]]A[y,X]-r[g,[/g]] 

=  (j  -  i)(x  a  y  a  [x,  y]  A  (g,  [/,*]]). 

Hence  s  =  t  and  the  same  is  true  for  a  •  yx--concatenatioo.  Therefore,  so  as  not  to 
violate  the  Maximum  Principle,  and  since  we  do  not  expect  any  degeneracies  in  the 
structure  of  the  reachable  set,  we  restrict  ourselves  to  the  following  two  surfaces: 

f”  =  {0  exp  (s,X)  exp  (s2  Y)  exp  (s3X):  s(  gO,  s,  +  s2  +  s3  =  1,  s,  §  s2,  syS  s2h 

f*  =  {0  exp  (/,  Y)  exp  (r2X)  exp  (r,  y):  i,  SO,  /,  + 12  + 1,  =  1,  r,  §  i2,  /,  s  t2}. 

Our  aim  is  to  build  T*  out  of  trajectories  from  T*  and  T'.  However,  as  they  are  at  the 
moment,  we  still  have  too  many  extremal  trajectories.  The  surfaces  and  T*  have  a 
nontrivial  intersection  y.  To  see  this  let  us  rewrite  the  defining  maps  in  terms  of 
canonical  coordinates  as  follows: 

0  exp  (j|X)  exp  (s2  Y)  exp  (s3X)  =  0  exp  (s,s2(s2  -  s,)[g,  [/,  g]J)  exp  (j,j2[X,  y]) 

x  exp  (s2Y)  exp((s,  +  s,)X), 
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0  exp  (l,  Y)  exp  ( r2X )  exp  (I,  Y)  =  0  exp  ( Nf,(2/,  -  r:  +  /,)[g,  [/.  g]])  exp  (f2/3[X,  V]) 

x  exp  ((f,  +  f,)  Y)  exp  (t2X)- 

If  we  equate  the  coordinates,  it  follows  easily  that  s,  =  f3,  x2  =  <2,  and  s3  =  /, .  It  follows 
that  r  and  1"  also  intersect  along  the  one-dimensional  curve 

y  =  {0  exp  (sX)  exp  {  Y/2)  exp  ((!-s)X):0Sss;j. 

We  need  to  analyze  the  intersection  more  closely.  Let 

q  =  0  exp  (s,X)  exp  (s2T)  exp  (s,X)e  y. 

Then  the  tangent  space  to  F'  at  q  is  spanned  by  (recall  that  x,  =  1  -  s,  -  s2) 

exp  (-jj  ad  X)  exp  (-s2ad  Y)X-X  =  s2([X,  V']  +  (2s,-s:)[g,[/,g]]), 
exp  ( -Si  ad  X)  Y  - X  =  2g  - ss[X,  y]-sl[g,  [/,  g]]. 

The  point  q  also  lies  on  f*  and  a  tangent  vector  to  T*  at  q  is 

t  =  exp  (-/,  ad  Y)X-Y  =  - 2g  +  f,[X,  Y]  -  r;[g,  [/,  g]]. 

In  the  intersection  r3  =  s,  =:  s,  s2  =  j  and  s,  =  t  —  s.  Thus 

T,f”  a  2  =  A(2g  a  [X,  yjAfftt/*]]) 

where 

1  J-i  -U-^)2 

A  =  .0  1  i-2s  =  2s(s-i)S0. 

-1  x  -x2 

Hence  f“  and  f*  intersect  transversally  except  at  the  endpoints  of  y  (x  =  0,  x  =  i). 
Observe  that.the  endpoints  are  characterized  by  the  condition  that  the  conjugate  point 
relation  s  =  t  ( =  5)  holds.  We  need  to  know  which  surface  has  a  larger  x3-coordinate. 
It  follows  from 

Tq f-  A  [g,  [ft/]]  -  — 2g  A  [X,  Y]  A  [ g ,  [/,  g}] 

that  t  and  [g,  [g,/]]  point  to  the  same  side  of  f-  at  q.  Observe  that  x,  =  0  for  points 
on  y.  Since  the  coefficient  of  t  at  g  is  negative,  the  points  of  f  +  for  which  x,  <0  have 
a  larger  x3-coordinate  than  those  points  on  FT  Conversely  for  x,  >  0  the  x3-coordinate 
of  points  on  is  larger.  Therefore  we  define 

r-:=  {0  exp  (x,X)  exp  ( s2  Y )  exp  (x,X):  xf  g0,  x,  +  x2  +  x3=  1,  x2s  |}, 

T*  :=  {0  exp  ( /,  Y)  exp  (/2X)  exp  (t,  Y):  t<  a  0, 1,  +  f2  +  h  =  1,  5}. 

Observe  that  T“  has  the  T-trajectory  in  its  boundary  and  that  the  X-trajectory  lies  in 
the  boundary  of  T*.  Define  r*:=rur+.  It  follows  from  above  that  (X,  [X,  K]J  = 
2[g,  [g,/]]  always  points  to  one  side  of  f”,  and  similarly  this  holds  for  T*.  Since  x,  a  0 
for  points  in  f,  x,  S  0  for  points  in  T-*  and  x,  =  0  exactly  on  the  intersection,  it  follows 
that  T*  is  a  piecewise  defined  function  x,  =  i/>(x, ,  x2). 

It  is  obvious  that  3F*  consists  of  all  trajectories  that  are  bang-bang  with  at  most 
one  switching,  i.e.,  dT*  =aT<.  Graphically,  the  structure  is  illustrated  in  Fig.  5. 

By  directional  convexity  it  is  clear  that  the  whole  set  R  between  T*  and  F*  lies 
in  Reach  (0,  1).  We  need  to  show  that  it  lies  nowhere  else.  The  points  of  f*  and  f" 
that  we  deleted  lie  in  the  interior  of  R.  (We  deleted  those  points  on  1  ,  respectively, 
f”  that  lie  below  f“,  respectively,  f+  in  the  direction  of  [X,  [X,  /]].)  But  this  implies 
that  the  endpoints  of  bang-bang  trajectories  with  more  than  two  switchings  lie  in  the 
interior  of  the  reachable  set  Suppose  we  have  an  extremal  XFXV'-trajectory  with 
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times  s,,  s2 ,  s3,  and  along  the  trajectories.  Then  =  by  the  conjugate  point 
relation,  and  thus  s2  <  s,  +  s3 .  By  the  invariance  of  the  structure  of  the  reachable  set 
it  follows  that  0  exp  (s,X)  exp  (s2  Y)  exp  (s3X)  e  int  Reach  (0,  s,  +  s2  +  s3).  (This  is  a 
point  of  the  type  we  deleted!)  Hence  the  trajectories  that  define  T+  and  T"  are  the 
only  extremal  trajectories  that  can  lie  on  the  boundary  of  the  reachable  set.  This  proves 
R  =  Reach  (0,  1). 

Summary.  For  every  time  t  the  time  reachable  set  is  a  stratified  set  that  is 
topologically  a  sphere.  Its  boundary  consists  of  two  hemispheres  T*(r)  and  T„(r)  whose 
common  relative  boundary  ar*(r)  consists  of  all  points  reachable  in  time  t  by  a 
bang-bang  trajectory  with  at  most  one  switch.  T*(f )  consists  of  all  bang-bang  trajectories 
with  at  most  two  switchings  for  which  the  time  along  the  intermediate  arc  is  greater 
than  or  equal  to  the  sum  of  the  times  of  the  adjacent  arcs.  T,(r)  consists  of  all  trajectories 
that  are  concatenations  of  a  bang  arc,  followed  by  a  singular  arc  and  another  bang 
arc,  where  the  times  along  these  trajectories  are  free  subject  to  OS  time  Sr.  The 
stratification  of  its  boundary  is  given  in  Figs.  4  and  5. 

4.2.2.  The  general  case.  We  now  show  that  the  qualitative  structure  of  the  small¬ 
time  reachable  set  does  not  change  in  the  general  case.  Clearly,  some  of  the  arguments 
will  have  to  be  adjusted;  for  instance,  the  correct  generalization  of  the  arguments  using 
directional  convexity  now  use  the  integral  curves  of  [X,  [X,  V]].  However,  finding  a 
general  version  for  the  explicit  computations  in  the  analysis  of  the  bang-bang  extremal 
trajectories  is  crucial. 

We  first  define  T*.  Recall  that  the  singular  control  is  given  in  feedback  form  as 
u  =  (8  +  l)/(5  -  1)  and  since  5<0  we  have  no  problems  with  u  hitting  the  control 
constraint  |u|=l  in  small  time.  Let  p  =  l/(l-5),  pe(0,l),  and  let  S'~ 
f  +  (8  +  1  )/(5  -  l)g  =  pX  +  (1  -p)  Y,  be  the  singular  vector  field.  Define 

r_,_  :=  { p  exp  (s,X)  exp  ( s:S)  exp  (s3X):  s,  S  0,  small}, 
r_,„  :=  (p  exp  (s,X )  exp  (s2S)  exp  (s,  Y):  s,  SO,  small}, 
r„, .  :=  { p  exp  ( r,  Y)  exp  ( f:S)  exp  ( r3X):  i,  a  0,  small}, 

T.,*  •=  !  p  exp  ( t,  Y)  exp  (r.S)  exp  ( f3  Y):  i,  a  0,  small}, 

r„  :=r.,.ur_,.ur.,-ur,„. 

If  we  replace  /by  S  in  Lemma  3,  then  the  statement  stays  true  verbatim  for  T,,, 
instead  of  r,„, .  (The  computations  are  a  straightforward  though  somewhat  messy 
extension  of  the  computation  in  the  "free"  nilpotent  case  and  we  omit  them.)  So  again 
r„  is  a  stratified  two-dimensional  surface;  its  one-dimensional  relative  boundary  <?I\ 
is  made  out  of  the  bang-bang  trajectories  with  at  most  one  switching. 
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Lemma  4.  For  sufficiently  small  T  the  points  on  are  entry  points  of  Reach  (p,  £  T) 

for  [X,[X,  Y]]. 

Proof.  The  strategy  is  the  same  as  in  the  proof  of  Lemma  2.  We  first  show  that 
the  extremals  on  T*  satisfy  the  necessary  transversality  condition  for  entry  points 
(which  are  not  due  to  the  time  constraint).  Then  we  show  that  T*  actually  is  a  graph 
with  the  coefficient  of  the  flow  of  [X,  [X,  Y]]  as  dependent  variable.  As  in  Lemma  2 
this  suffices  to  prove  our  result. 

If  y  is  any  trajectory  containing  a  singular  arc  then,  for  sufficiently  small  time, 
(A,  [X,  [X,  V]])  will  dominate  (A,  X),  (A,  Y ),  and  (A,  [X,  V]),  in  particular,  it  has 
constant  sign.  Along  the  singular  arc  (A,  [X,  [X,  Y]]>  =  25/(1  -5)  (A,[g,  [X,  Y]])  and 
the  generalized  Legendre-Clebsch  condition  implies  that  (A,  [X,  [X,  V]])  is  positive. 
This  shows  that  points  in  1%  satisfy  the  necessary  transversality  condition.  An  argument 
analogous  to  the  one  made  in  the  proof  of  Lemma  1  shows  that,  in  fact,  any  extremal 
trajectory  for  which  (A,  [X,  [X,  Y]])  is  positive  has  to  be  of  the  form  BSB,  that  is,  we 
have  exhausted  all  possible  candidates.  To  prove  that  indeed  each  point  on  T,  has 
the  entry  property,  we  show  again  that  we  can  think  of  T*  as  the  graph  of  a  piecewise 
defined  function  x,  =  ij/(x0,  x,,  x2),  where  (x0,  x, ,  x2,  x3)  are  canonical  coordinates  of 
the  second  kind,  and  x3  is  the  coefficient  at  the  flow  of  [X,  [X,  Y]].  Let  us  consider, 
for  instance,  T+,_.  It  is  easier  to  compute  the  pullback  of  the  ta.  gent  space  to  the 
endpoint  of  the  singular  arc.  It  is  spanned  by  X,  S,  and  exp  (~f2  ad  S)X.  Note  that 
S  =  pX  +  ( 1  +  p )  Y  and  it  follows  by  induction  that  ad"S(X)  =  aji  +p„Y+  y„[X,  Y] 
with  smooth  functions  a„,  /3„,  y„: 

[S,  ad”'1  S(X)]  =  [pX  +  (l  - p)Y,  flr„_|X  +  pn-\  Y  +y„_,[X,  Y]] 

=  y„-(p[X,[X,  Y]]  +  (l-p)[Y,[X,  Y]])  +f  g  or  [f  g]  terms 
=  p(aX  +  (}Y  +  y[X,  Y]). 

Also  [5,  X]  =  [pX  +  (l  -p)  Y,  X]  =  2L,0)g  +  (p  —  1)[X,  Y].  Therefore 

X  a  S  a  exp  (-r2  ad  S)X  =  (1  -p)2t2(l  +  0(f2))  ■  (/a  g  a  [X,  Y]). 

Now  if  we  take  the  wedge-product  with  [X,  [X,  Y]]  pulled  back  along  X,  t,  this  yields 
X  a  S  a  exp  (-f2  ad  S)X  a  exp  (f3  ad  X)([X,  [X,  Y]]) 

=  (I-p)2f2(l  +  0(T))-(/AgA[/,g]A[X,[X,  Y]]) 

and  there  are  no  problems  with  dominance  since  f2  factors.  Hence  [X,  [X,  Y]]  always 
points  to  one  side  of  r«.,_  in  the  interior.  Analogously  it  follows  for  the  other  surfaces. 
By  continuity  this  also  follows  for  the  one-dimensional  strata.  Straightforward  but 
slightly  more  tedious  computations  show  also  that  the  projections  of  the  relative 
interiors  of  the  sets  Ttl±  onto  (x0,  x,,  x2)-space  are  pairwise  disjoint.  Therefore  is 
a  graph  in  canonical  coordinates.  This  proves  the  lemma.  0 

The  analysis  of  the  bang-bang  extremals  is  more  difficult.  We  start  by  computing 
the  conjugate  point  relations.  Suppose  y  is  a  -XYX- -concatenation  starting  at  p  with 
junctions  at  p,  p, ,  p2,  p,  and  times  s,,  s2,  s3  along  the  respective  trajectories.  Then  we 
have  (the  vector  fields  are  evaluated  at  p,): 

0  =  X  A  Y  A  (eXP  ( X-L-l)  Y  A  (e*P- 1 li)  X 
(16)  =  X  a  Y  a  [ X,  Y]-ij,[X,[X,  Y]]+0(s;)a-[X,  Y]-^2[Y,[X,  Y]]  +  0(i5) 

=  |?(i„J:)(Xa  Ya[X,  Y]  a  [  Y,  [X,  Y]J)|„ 
where  o( s, ,  i2)  =  -s,5  - s2+  0(2). 
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The  equation  c?(s,,s2)  =  0  has  a  unique  solution  Jj(s2)  and  in  general  XYX- 
trajectories  only  satisfy  the  necessary  conditions  of  the  Maximum  Principle  if  s,  £s[(j,). 
Note  that  o-(0,  s,)<0  and  so  this  is  equivalent  to  g-(s,,s2)§0.  (Using  an  argument 
analogous  to  (9)  it  can  be  shown  that  extremal  trajectories  do  indeed  have  switchings 
at  s,  =  sj,  but  we  will  not  need  this.)  Furthermore, 

0  =  x(pJly(P;)A(5E^),(pi) 

)  Y(p,) 

=  x  A  Y  A-[X,  V]  +  ~L,[V.[X,  Y]]  +  ---  A[X,  T]  +  ^j[X,[X,  y]]  +  -  •  • 

=^(s!,j,)(Xa  Ya[x,  y]A[y,[x,  y]])|„ 

where  <r(s2,  s3)  =  -s2- s2S  +  0(  T2). 

Again  the  equation  <?(s2,  s3)  =  0  can  be  solved  by  sj(s2),  and  yxy-concatenations 
only  satisfy  the  Maximum  Principle  if  s2£sl(s2).  Since  <r(s2,0)<0  this  is  equivalent 
to  tr(s2,  s3)^0. 

Therefore  we  define 

f"  =  {p  exp  (s,X)  exp  (s2Y)  exp(s3X):  s,g  0,  small,  s2  is  free, 

?(s,,i2)S0,  <t(s2,  Sj)£0). 

Analogously  we  must  compute  the  conjugate  point  relations  along  a  •  yxy--concatena- 
tion  which  yields 

f+:=  {p  exp  (t,y)  exp  (t2X)  exp  (t2Y):  r,§ 0,  small  t2  is  free, 

j(  t , ,  t2)  S  0 « /,  S  f, ( 1 2)  f  ( t2 ,  /3)  a  Oo  r3  s  /,( t2)} 

where 

Sd,.  h)  =  -t,  -  f25  +  0(T2),  r(t2,  t3)  =  ~t2S  -  h+  0(T2) 

and  f,  and  f3  are  the  solutions  of  r  =  0  and  f  =  0,  respectively,  i  and  T"  are  three- 
dimensional  surfaces  with  relative  boundary  made  up  entirely  of  bang-bang  trajectories 
with  at  most  one  switch. 

Lemma  5.  The  surfaces  f”  and  l  intersect  along  a  two-dimensional  surface  F. 
The  intersection  of  f  with  the  relative  boundaries  dt~  and  T  are  the  following 
one-dimensional  curves: 

y  =  [p  exp  (j,X)  exp  (r2y):  s2  S  0,  small,  s,  =  s,(s2)}, 

y  =  {p  exp  (/,  Y)  exp{t2X):  /:gO,  small,  /,  =  tj(t2)} 

( i.e.,  the  trajectories  corresponding  to  the  conjugate  points ).  Away  from  y  and  y  the  surface 
entirely  lies  in  the  relative  interior  of  f",  respectively ,  f*  and  there  the  intersection  is 
transversal. 

Proof  We  want  to  solve  the  equation 

(17)  p  exp  (s,X)  exp  ( s2Y )  exp  (s,X)  =  p  exp  (t,  Y)  exp  ( t2X )  exp  (f3y). 


/ exp  (sx  ad 

\  Sx 


ad  X)~  1 
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Suppose  a  point  q  in  the  relative  interior  of  I'*  or  f"  lies  on  We  claim  that  (16) 
can  be  solved  in  terms  of  t ,  and  t2  near  q.  This  follows  from  the  Implicit  Function 
Theorem  if  the  Jacobian  with  respect  to  (a,,  s2,  s3,  /,)  is  nonsingular  at  q.  If  we  compute 
these  derivatives  and  pul!  the  vectors  back  along  X  we  get 

exp  (-s3  ad  Y)X  a  Y  a  X  a  exp  (a,  ad  X)Y 

exp  ( -S;  ad  y)  — A  /exp  (s,  ad  X )  -  1 

-s2  /  \  s, 


=  s2s,  ^  A’  a 


=  -s:s,  ■  o(s:,s,)(Xa  Y[X,  y]A[y,[x,  V]])|p 

>*/*  e\p  <  v,  \  )  exp  V  I  • 


But  in  int(f')  s2  and  s3  are  positive  and  also  cr(s2,s3)<  0  since  the  conjugate  point 
relation  does  not  hold.  So  we  can  solve  in  terms  of  f,  and  t2.  This  computation  shows 
also  that  f*  and  intersect  transversaily  in  int(T^)  or  int  (F-). 

Next  we  show  that  points  q  of  this  type  exist.  For  that  we  rewrite  both  sides  of 
(17)  in  terms  of  canonical  coordinates  of  the  second  kind.  A  short  computation  (cf., 
for  instance,  [16])  shows  that 

p  exp  (j|X)  exp  (s3  Y )  exp  ( s2X)~  p  exp  (5S,s:(s,S  +  s2  +  0(S2))[  Y,  [X,  y]]) 


•  exp  (s,s2(l  +  0(S))[X,  y]) 

•  exp  ((s2+  0(S3))  Y)  exp  ((s,  +  s3+  0(S3))X), 
p  exp  (/,  y)  exp  (f3X)  exp  (t3  Y)  =  p  exp  (jt2t3(2r,  +  r3+  t2S  +  0(T2))[  Y ,  [X,  y]]) 


■cxpU2C3(l+0{T)KX,  V]) 

■  exp  ((t,  + 13+  0(T3))  y)  exp  ((t2+  0(T3))X) 

where  0(S1')  or  0(Tk)  stand  for  terms  of  order  greater  than  or  equal  to  k  in  the  total 
time,  S  =  s,  +  s2  + s3,  T=  t,+ 12  +  t},  and  S  is  evaluated  at  p.  Equating  coefficients  we  get 

(i)  s,  +  s3  +  0(S3)  =  r2  +  0(7'3), 

(ii)  S'+0(S3)  =  r,  +  r3+0(T3), 

(18) 

(iii)  s,s2(l  +  0(S))  =  t2r3{l  +  0(r)), 

(iv)  s,s2(s,S  +  j2+  0(S2))  =  l2t3(2t,  +  r3+  f25  +  0(  T2)). 


If  we  assume  that  all  switching  times  are  comparable,  i.e.,  of  order  T,  then  (18(i),  (ii)), 
and 

(iv')  s,  5  +  s2  +  CHS2)  =  2t,  +  r3  +  r2«  +  0(  T2) 
can  easily  be  solved  for  s  in  terms  of  t  modulo  higher-order  terms: 

s,  =  /2  +  -  ;,  +  0(  T2), 

0 

(19)  r3  =  f,  +  f3  +  0(r3), 

**--7 '.  +  <?(  7*). 

o 


With  these  times  the  conjugate  point  relations  cannot  hold  since 
(20)  <?(j2,  j,)  =  -j2-j35  +  0(rJ)  =  -(3  +  0(r2) 
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is  negative.  So  the  corresponding  point  q  lies  in  fact  in  the  relative  interior  and  therefore 
it  is  possible  to  solve  for  t}  in  terms  of  /,  and  i2: 

(21)  t2  =  -t,  -  Sr2+  0(T2). 

This  gives  a  solution  to  (18).  Note  that 

(22)  h  =  pT+  0{  T-)  =  — ^— +  0(  T2). 

I  —  o 

As  long  as  (t,,t2,h)  are  bounded  away  from  the  boundary  of  the  simplex 
/,  +  /,  + 13  s  T,  the  times  are  comparable,  these  computations  are  justified,  and  we  get 
a  two-dimensional  intersection  that  we  can  parametrize  by  t,  and  (2.  The  problem  is 
whether  it  extends  all  the  way  to  the  boundary.  But  the  equations  (19)  and  (21)  are 
well  defined  for  f,->0  (in  a  time-slice  1,  +  i2+  i2  =  T  it  follows  that  t}-* -Si2+  0(t\), 
i.e.,  to  a  limit  of  order  T.  By  (20)  this  implies  that  the  two-dimensional  surface  defined 
by  these  functions  of  (r,,  t2)  stays  away  from  the  conjugate  point  condition  d{s2 ,  s3)  =  0. 
Hence  the  implicit  function  theorem  is  still  applicable.)  Therefore  f  extends  all  the 
way  out  to  /,  =0,  i.e.,  to  the  XY  boundary  surface. 

A  precise  characterization  of  P IT r~fT{p  exp  (s,X)  exp  (s2  y):  s,  SO,  small)  is 
possible.  Clearly  these  are  points  such  that  tt=0,  t2  =  s, ,  1 3  =  s2,  and  0  =  s3.  Since 
(s, ,  s2, 0)e  dom  P  we  have  <r(s,,  s2)  =  0.  and  since  (0,  s, ,  s2)e  dom  P  we  have 
f(j,,j2)S0.  But  in  this  case  <x(s, ,  s2)  =  f(j,, s2)  (cf.  (16)  and  the  analogous  formula 
for  f).  Therefore  g (sx,  s2)  =0,  i.e.,  s,  =  T,(s2),  the  conjugate  point  relation. 

This  proves  that  t~  fl  f  *  extends  all  the  way  out  to  the  XV-boundary  surface  and 
that  the  intersection  with  the  Xy-surface  is  the  one-dimensional  curve  y  consisting  of 
the  conjugate  points. 

Analogously  we  can  show  that  (17)  can  also  be  solved  in  terms  of  s,  and  s2  in 
int  (f-).  Using  these  formulas  we  can  show  that  f“n  f+  extends  all  the  way  up  to  the 
yX-boundary  surface  and  that  the  intersection  of  fl  f*  with  the  yX-surface  consists 
of  the  curve  y.  0 

Note  that  in  a  time-slice  f ,  +  t2  +  t}=  T  the  qualitative  geometric  structure  of  U  f* 
is  exactly  as  in  the  free  nilpotent  case.  Only  the  condition  t2  =  T/2  is  replaced  by 
rz  =  (i/(l  —  5))7~  (modulo  higher  terms)  which  shifts  f  away  from  the  center.  This  is 
illustrated  in  Fig.  6. 

The  surface  f  bisects  T*  and  and  only  one  of  the  two  components  has  the  y-, 
respectively,  X-trajectory  in  its  boundary.  We  define  f  and  P'  to  be  these  components 


Fig.  6 
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and  let  F*  =  I'"'  U  r~.  It  is  then  clear  that  T*  is  a  three-dimensional  stratified  surface 
whose  relative  boundary  consists  of  all  bang-bang  trajectories  with  at  most  one 
switching,  i.e.,  5r*  =  5r+. 

Lemma  6.  The  points  in  T*  are  exit  points  of  the  small-time  reachable  set  for 
[X,[X,  Y]]. 

Proof.  It  is  easy  to  see  (cf.  (10))  that,  for  sufficiently  small  time,  all  extremals  on 
F~  or  F*  satisfy  the  necessary  transversality  condition  (A,[X,  [X,  y]])S0. 

We  show  first  that  the  points  that  we  deleted  from  f”  and  F*  are  not  exit  points 
(see  Fig.  7).  Let 

q-p  exp  (s,X)  exp  (s2  Y )  exp  (ssX)  =  p  exp  (/,  Y)  exp  (t2X)  exp  ( t3  Y) 

be  a  point  in  the  relative  interior  of  f.  f-  and  i  intersect  transversally.  It  follows  as 
in  the  proof  of  Lemma  2  (cf.  (11))  that  the  XYX-  and  yxy-surfaces  are  graphs 
xt  =  i/i(x,  ,  x2,  x3)  in  canonical  coordinates  of  the  second  kind  with  x4  the  coefficient 
at  the  flow  of  [X,  [X,  Y]].  This  inherits  on  f  ”  and  f +.  To  prove  that  the  parts  of  f  " 
(respectively,  Ji  )  that  we  delete  are  not  exit  points,  it  suffices  to  show  that  these  parts 
lie  below  f+  (respectively,  F~)  in  direction  of  [X,  [X,  V]]. 


X 


The  tangent  space  to  f-  at  q  is  spanned  by  X,  exp  (-s3  ad  X)  Y  and 
exp  ( — 5j  ad  X)(exp  ((- s2  ad  V)-  l)/-s2)X.  To  show  that  the  part  of  P*"  that  we  deleted 
lies  below  f"  near  q  it  suffices  to  show  that  (X,  [X,  y]]  and  a  tangent  vector  l  to  t* 
that  is  oriented  toward  the  sector  of  T*  that  we  deleted  point  to  opposite  sides  of  T,r_. 
We  get  such  a  vector  t  if  we  lengthe.'  the  time  along  the  last  Y  leg.  (We  delete  the 
piece  that  contains  in  its  boundary  the  trajectories  corresponding  to  the  conjugate 
point  relation  t,=  t}(t2).) 

Instead  of  computing  at  q  we  pull  back  all  vectors  along  X,  s3  and  get 
exp  (+ s,  ad  X)(  T,f")  a  exp  (+s,  ad  X)[X,  [X,  y]] 

=  |aa  y^i!ip(~^d  ^~'|xa  exp  (s3  ad  X)[X,  [X,  y]]) 


=  — (s  +  of t))(x a  yA[x,  y]A[y,[x,  y]])|„.,..p(.,x, 
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exp  (s3  ad  X)(T,f  )  a  exp  (s3  ad  X)  Y 

exp  (-5:  ad  V)  -  1\  / exp  (s3  ad  X)- 1 

)  \  h 

=  S,a(i2.  5,)(X  A  Y  A  [X,  Y)  a  [  Y,  [X,  y]])|„. 

But  q  is  a  point  in  F,  and  f  lies  entirely  in  the  relative  interior  of  f~  except  for  the 
obvious  boundary  curves  y  and  y.  In  particular  (cf.  also  the  proof  of  Lemma  5)  the 
conjugate  point  relation  s3  =  s3{s2)  does  not  hold,  or  equivalently,  <r(s2,  *3)  <0.  So 
these  wedge-products  have  opposite  signs,  which  proves  our  claim.  This  also  implies 
that  the  portion  of  f  “  that  we  delete  lies  below  t*,  and  since  there  is  no  other  intersection 
this  holds  for  all  the  points  we  deleted. 

The  stratified  sets  T*  and  T*  enclose  a  region  R  that  lies  in  the  small-time  reachable 
set.  In  particular,  the  portions  of  and  I'+  that  we  deleted  therefore  fie  in  the  interior 
of  the  reachable  set.  Since  these  pieces  contain  the  trajectories  corresponding  to  the 
conjugate  points  t3  =  f3(r2)  and  r3  =  3J(r2),  it  follows  that  no  bang-bang  trajectory  with 
more  than  two  switchings  lies  in  the  boundary  of  the  small-time  reachable  set.  Hence 
the  points  in  T*  are  the  only  possible  exit  points  of  the  small-time  reachable  set  for 
[X,  [X,  T]].  It  follows  from  the  construction  of  T~  and  T*  that  T*  is  also  a  graph. 
Again,  the  projections  onto  (x,  ,x2,x3)-space  are  disjoint.  Therefore  it  follows  as  in 
Lemma  2  that  the  points  on  T*  have  the  exit  property  for  sufficiently  small  time.  D 
Finally,  T*  and  T*  do  not  intersect  in  their  relative  interiors.  It  is  now  clear  how 
the  small-time  reachable  set  looks:  It  is  the  set  of  points  enclosed  by  the  two  three- 
dimensional  stratified  surfaces  I'*  and  T* .  T*  consists  of  bang-bang  trajectories  with 
at  most  two  switchings  such  that  modulo  higher-order  terms 

(23)  /,  +  5f2+r3si0 

if  t, ,  t2,  and  t3  are  the  consecutive  times  along  a  YXY  arc  and 

(24)  i,5  +  s2  +  s35g0 

if  s, ,  s2,  s3  are  consecutive  times  along  XYX.  T,  consists  of  all  concatenations  of  a 
bang  arc,  followed  by  a  singular  arc  and  another  bang  arc  where  the  time  along  the 
trajectories  is  free.  T*  and  T*  have  a  common  relative  boundary  C  consisting  of  all 
trajectories  that  are  bang-bang  with  at  most  one  switching.  For  sufficiently  small-time 
T  a  time-slice  of  the  reachable  set  has  exactly  the  same  qualitative  geometric  structure 
as  for  the  free  nilpotent  system  (13).  Furthermore,  if  S(-)  is  an  integral  curve  of 
[X,  [X,  T]]  such  that  S(t,)  and  5(r2),  r,  <  t2,  lie  in  the  small-time  reachable  set,  then 
so  does  the  whole  curve  S(t),  t,£t£t2.  The  points  on  T*  are  entry  points  for 
[X,  [X,  Y] ];  the  points  on  T*  are  exit  points. 

Remark  We  emphasize  that  the  result  is  not  what  might  be  expected  intuitively. 
From  dimensionality  we  could  conjecture  the  occurrence  of  bang-bang  trajectories 
with  two  switchings,  respectively,  BSB  trajectories  in  the  boundary  of  the  small-time 
reachable  set.  Also,  this  is  essentially  what  was  partially  known  from  earlier  results. 
However,  we  see  no  simple  reasoning  that  could  explain  why,  in  fact,  some  of  these 
bang-bang  trajectories  with  two  switchings  are  not  a  part  of  the  boundary.  This  is  only 
revealed  by  our  analysis. 

4.3.  Time-optimal  control  in  dimension  three.  Our  results  have  immediate  implica¬ 
tions  on  time-optimal  control  in  dimension  three.  Suppose  the  triples  (g,  [/,  g],  [/+  g, 
[/.  g]])  ancl  (?.  [/.  g]>  [/ “  g.  [/.  g]])  consist  of  independent  vectors  at  a  point  p  in  R3. 
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Equivalently,  suppose  that  the  constant  controls  u^+1  and  u  =  -  I  are  not  singular. 
If  we  augment  the  three-dimensional  system  2  to  a  four-dimensional  system  £  by 
introducing  time  as  a  coordinate,  .x„=  1,  x„(0)  =  0,  i.e.. 


then  if  a  ^-trajectory  ,x(-):(0,  T]->R'  steering  p  to  q  is  time-optimal,  the  augmented 
trajectory  x  lies  in  the  boundary  of  the  reachable  set  from  p.  The  augmented  system 
1  satisfies  ,  ur  assumptions  (A)  and  (B),  and  therefore  time-optimal  trajectories  are 
bang-bang  with  at  most  two  switchings  or  concatenations  of  a  bang-arc,  followed  by 
a  singular  arc  and  one  more  bang  arc.  Under  additional  assumptions  this  result  was 
obtained  earlier  by  Bressan  [4],  who  studied  only  trajectories  emanating  from  an 
equilibrium  point  of  /  and  by  Sussmann  [22]  and  Schattler  [17]  who  both  assumed 
in  addition  also  that  f  g  and  [/  g]  were  independent.  Our  analysis  shows  that  the 
vector  field  /  is  irrelevant  and  we  do  not  have  to  make  any  assumptions  about  it.  Our 
results  are  also  more  precise  in  the  sense  that  we  can  exclude  the  optimality  of  those 
bang-bang  trajectories  with  two  switchings  that  violate  (23)  (respectively,  (24))  in  the 
bang-bang  singular  case.  We  summarize  in  the  following  corollary. 

Corollary.  Suppose  the  vector  fields  g,  [f  g]  and  [/  +  g,  [/  g]]  are  independent 
near  a  reference  point  p  &  R5.  Write 

[ f~g .  [/«]]  =  ag  +  b[f,  g]  +  c[/+  g,  [/  g]] 

and  assume  that  c  does  not  vanish.  Then  we  have  in  small  time : 

(i)  If  c>0,  then  time-optimal  trajectories  are  bang-bang  with  at  most  2  switches. 

(ii)  If  c  <  0,  then  time-optimal  trajectories  are  bang-bang  with  at  most  two  switching j 
or  are  concatenations  of  a  bang  arc,  a  singular  arc,  and  another  bang  arc.  Time-optimal 
XYX  (.respectively,  YXY)  concatenations  satisfy  modulo  higher-order  terms 

c(s,  +s3)  +  J2gO  ( resp.,  r,  +  /j  +  cr2S0) 

where  s , ,  j2,  s,  ( respectively ,  t, ,  t2,  t3)  are  the  consecutive  times  along  the  bang  arcs. 

5.  A  brief  outlook  to  higher  dimensions.  We  have  outlined  a  general  method  to 
determine  the  structure  of  the  small-time  reachable  sets  and  proved  its  effectiveness 
in  nondegenerate  cases  in  small  dimensions.  One  of  the  difficulties  that  will  become 
more  and  more  prominent  in  higher  dimensions  is  that  the  necessary  conditions  of  the 
Maximum  Principle  will  not  restrict  the  class  of  extremal  trajectories  sufficiently  enough 
to  give  the  candidates  for  f*  and  T*. 

Under  assumptions  (A)  and  (B)  in  dimension  four,  we  could  overcome  this  problem 
by  taking  a  corresponding  “free"  nilpotent  system  of  the  same  dimension  as  a  guide. 
We  do  not  expect  this  to  happen  in  general.  In  fact,  for  the  five-dimensional  system 
£,  where  we  assume  that  /,  g,  [fig],  [f[fig]],  and  [g,  [fig]]  are  independent,  the 
small-time  reachable  set  has  extremal  trajectories  in  its  boundary  that  do  not  appear 
in  the  analogous  five-dimensional  free  nilpotent  system.  The  reason  for  this  lies  in  a 
-  qualitatively  different  behavior  of  the  singular  controls,  specifically,  in  the  fact  that 
singular  controls  can  now  hit  the  control  constraint  |u|  =  1  and  may  have  to  be 
terminated.  Nevertheless,  the  fret  nilpotent  system  contains  most  of  the  information 
about  the  small-time  reachable  set,  though  it  does  not  characterize  it  completely.  To 
be  more  specific,  we  will  briefly  describe  (without  proofs)  the  structure  of  the  reachable 
set  for  the  free  nilpotent  system  in  dimension  five  and  how  the  general  case  differs 
from  it. 
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We  take  as  our  model: 

x0=l,  x,  =  u,  x2  =  xu  x3  =  x2,  x4  =  jxf. 

It  is  no  problem  whatsoever  to  carry  out  the  analysis  within  our  technique  as  in  the 
construction  in  §  4.2.1.  Now  the  reachable  set  is  convex  in  direction  of  (0,  0,  0,  0,  1  )r  = 

[g,  [g,/]]  and  T*,  respectively,  T*  will  consist  of  those  trajectories  that  are  exit, 
respectively,  entry  points. 

It  follows  from  the  generalized  Legendre-Clebsch  condition  that  r*  contains 
concatenations  with  singular  arcs,  whereas  T*  will  consist  of  bang-bang  trajectories  ! 

only.  Singular  controls  are  constant,  but  now  they  can  take  on  any  value  in  [-1,  1].  i 

Let  T,  =  r.„.  U  T-,,  U  U  ,  where 

i 

r_u_:={Oexp(s,X)exp(s2(/+ug))exp(s1.V):  sf  g  0,  s,  +  +  s3  =  1,  «e[-l,  1]},  1 

etc.  (By  the  invariance  property  of  the  reachable  set  we  can  restrict  to  the  time-slice 
T  =  1.)  The  points  on  T*  are  precisely  the  ones  that  minimize  the  coordinate  x4. 

For  a  fixed  value  u0  of  the  singular  control,  -1  <  u0<  +1,  the  qualitative  structure 
of  r*.Uo  =  T,  restricted  to  values  u  =  u0  is  precisely  as  in  4.2.2,  Fig.  4  (see  Fig.  8). 

For  u0  =  +1,  T_u_  f  u  =  1  reduces  to  r_4_  and  all  other  strata  become  trivial  whereas  I 

for  u0  =  - 1 ,  r+u,.  f  u  =  ~  1  =  r+_+  and  the  remaining  strata  are  trivial.  For  each  of  these 
two-dimensional  surfaces  (u0  fixed)  the  relative  boundary  consists  of  all  bang-bang 
trajectories  with  at  most  one  switching.  The  surfaces  r,iUll  themselves  interpolate 
between  T*.,.  for  ua=  -1  and  T_+_  for  u0-  1.  Topologically  f*  is  a  stratified  sphere  I 


r. 


Fig.  9 
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with  ar*  =  r.,.ur,.r,  i.e.,  all  bang-bang  trajectories  with  at  most  two  switchings 
(see  Fig.  9). 

The  surface  T*  consists  of  bang-bang  trajectories  analogous  to  the  bang-bang 
singular  case  in  dimension  four.  Now 


I”  =  {0  exp  (x,X)  exp  (s:  Y)  exp  (x,X)  exp  (s4  Y):  s,  S  0,  s,  +  s;  +  s,  +  s,=  1, 

x,  S  jj,  s4s  s.-conjugaie  point  relations}. 

f-  =  {0  exp  (r,  V)  exp  (f,X )  exp  (t3  Y)  exp  (t4X):  i,g0,  t,  + 12+  t3  + 14  =  1, 

l,  £  <j,  r4S  t:-conjugate  point  relations}. 


f  and  f  *  intersect  in  a  two-dimensional  surface  F,  which  consists  of  those  trajectories 
for  which 

(s,  +  x,)*'  -  (s,  +  s})  +  2s2s,  =  0, 


respectively. 

The  intersection  is  transversal  except  at  those  points  that  lie  on  the  relative 
boundary  of  f”  or  FT  These  points  are  again  characterized  by  the  conjugate  point 
relation 

fnr_„  =  {Oexp  (t,  V)  exp  ( t2X )  exp  (t3  Y )  exp  (r4X):  t,  =  0,  t4=  t,}, 
fnr,.*  =  {0  exp  (f,  Y)  exp  (t;X)  exp  (t3  Y)  exp  (f4X):  r,  =  t3,  f4  =  0}. 

We  define  F~  (respectively,  T*)  as  the  component  of  f"  (f+)  containing  the 
VX-curve  =  (0  exp  (s2y)  exp  (s3X):  jj  a  0,  s2  +  ^j =  1}  (respeaively,  the  XV-curve)  in 
its  boundary.  Then  r*:=rur  consists  precisely  of  those  points  that  maximize  x4 
on  the  reachable  set.  Note  that  topologically  T*  also  is  a  stratified  sphere  with 
ar*  =  r.*-Ur.-»,  the  set  of  all  bang-bang  trajectories  with  at  most  two  switchings 
(see  Fig.  10). 

The  key  fact  here  is  that  it  is  still  obvious  that  dT*  and  dr„  match  up.  They  are 
identical.  It  is  therefore  clear  that  Reach  (0,  I)  is  the  set  of  all  points  that  lie  between 
T*  and  T*. 

It  is  precisely  this  simple  reasoning  that  breaks  down  in  the  general  case.  The 
cause  for  this  lies  in  the  structure  of  the  singular  controls.  The  analysis  of  the  bang-bang 
trajectories  carries  over  to  the  general  case  with  only  one  minor  change  in  the  structure. 
Whereas  in  the  free  nilpotent  system  the  two  curves  F  fl  T*—  and  f  fl  T_4_  both  have 
points  corresponding  to  the  X-  and  K-trajectories  as  endpoints,  this  need  no  longer 
be  true:  ffir,.-.,  is  a  curve  starting  at  0exp(l  ■  Y)  but  which  in  general  no  longer 
ends  in  0exp(l  ■  X)  but  rather  on  a  point  in  the  XV-curve  (respectively,  YX-curve). 
This  distortion  is  due  to  the  presence  of  fourth-order  brackets.  One  possible  case  is 
depicted  in  Fig.  11. 

Still  the  relative  boundary  of  T*  consists  of  all  bang-bang  trajectories  with  at  most 
two  switchings.  The  structure  breaks  down  in  the  analysis  of  the  singular  surface  T, 
for  u  near  ±1.  The  reason  is  that  in  the  presence  of  fourth-order  brackets  the  singular 
controls  are  no  longet  constant,  and  thus  the  analogue  of  for  u0=-l  does  not 
reduce  to  T.,..,,  i.e.,  to  bang-bang  trajectories  with  two  switchings.  For  instance,  it 
may  not  be  at  all  possible  to  sttrt  a  singular  control  with  u0=  -1.  This  is  the  case  if 
u  <0  at  u0=  -1,  which  happens  under  generic  assumptions  on  fourth-order  brackets. 
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Fig.  10 


to  the  right  of  r  :  XYXY 
to  the  left  of  f  :  YXYX 


For  the  same  reason,  singular  controls  with  u0  close  to  ±1  may  have  to  be  terminated 
when  they  become  one  in  absolute  value.  If  the  singular  control  becomes  saturated 
(i.e.,  hits  the  constraint  and  cannot  be  continued)  then  this  determines  the  subsequent 
structure  of  the  trajectory  and  it  is  easy  to  see  that  concatenations  such  as  BSBB  or 
BBSB,  which  are  not  present  in  the  free  nilpotent  system,  come  into  play.  Therefore 
T*  has  trajectories  in  its  relative  boundary  that  contain  singular  arcs.  The  main  challenge 
in  applying  our  technique  to  higher  dimensions  seems  to  be  finding  a  way  to  decide 
whether  structurally  different  trajectories,  such  as  a  bang-bang  trajectory,  and  a  concate¬ 
nation  that  contains  a  singular  arc  steer  a  system  to  the  same  point.  Once  <jr*  and  dT, 
can  be  identified,  it  is  clear  that  the  set  they  enclose  is  the  small-time  reachable  set. 

Note,  however,  that  this  structural  instability  only  happens  near  r„._,  and  T,.*,. 
The  structure  of  most  of  the  trajectories  in  the  boundary  is  still  the  same  as  in  the  free 
nilpotent  systems.  And  it  is  intuitively  clear  that  the  structure  of  the  exceptional 
trajectories  will  come  up  in  a  higher-dimensional  nilpotent  system.  Therefore,  in  our 
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view,  the  study  of  the  structure  of  the  reachable  sets  for  nilpotent  systems  will  be  the 
key  to  the  general  problem. 

6.  Summary.  We  have  described  an  approach  to  determining  the  qualitative  struc¬ 
ture  of  the  small-time  reachable  set  in  a  nondegenerate  situation.  It  is  a  ncmtrivial 
extension  of  a  construction  done  by  Lobry  in  dimension  three.  In  dimension  fo^we 
succeed  completely  in  determining  the  small-time  reachable  set.  For  higher  dimensions 
obstacles  still  have  to  be  overcome.  However,  they  do  not  lie  in  the  general  structure 
of  our  approach,  but  in  the  fact  that  too  little  is  known  about  the  structure  of  extremal 
trajectories  in  higher  dimensions.  For  instance,  in  the  five-dimensional  case,  what  is 
the  precise  structure  of  extremal  trajectories  that  contain  a  saturated  singular  arc?  For 
dimensions  six  and  beyond,  the  crucial  new  ingredient  appears  to  be  the  incorporation 
of  chattering  arcs,  another  structure  of  extremal  trajectories  about  which  little  is  still 
known. 
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1.  INTRODUCTION 

The  close  connection  between  Markov  processes,  diffusions  and  parabolic  partial 
differential  equations  is  of  course  well-known.  In  this  paper  we  shall  describe  the 
beginning  of  a  new  theory  which  links  reciprocal  processes,  second  order  diffusions  and 
the  partial  differential  equations  of  fluid  mechanics,  i.e.,  the  continuity,  Euler  and 
energy  balance  equations. 

2.  RECIPROCAL  PROCESSES 

In  the  early  thirties  E.  Schrodinger  [2,3]  introduced  a  new  class  of  stochastic  processes 
in  attempt  to  formalize  the  stochastic  aspects  of  Quantum  Mechanics.  This  concept  was 
formalized  by  S.  Bernstein  [1]  in  an  address  to  the  International  Congress  of 
Mathematicians  in  Zurich  in  1932.  Bernstein  defined  a  reciprocal  process  x(t)  as  one 
where  conditioned  on  the  values  x(tg)  and  x(t^)  of  the  process  at  two  times  tg  <  tp  the 

process  exterior  to  [tg,t  J  is  independent  of  the  process  interior  to  [tg,tjj.  This  is  readily 

seen  to  be  a  generalization  of  the  Markov  property,  i.e.,  conditioned  on  single  time  tg 

the  process  before  tg  is  independent  of  the  process  after  tg.  Hence  every  Markov  process 

is  reciprocal  but  the  converse  is  not  true. 

The  reciprocal  property  is  the  specialization  to  one  dimension  of  P.  Levy's  definition 
of  a  Markov  random  field  [20].  There  are  two  other  ways  of  viewing  the  reciprocal 
property.  Suppose  x(t)  is  a  random  process  taking  values  in  lRn  and  defined  for  t  e  [0,T]. 
We  define  another  process  X(tg,t^)  =  (x(tg),x(tj))  taking  values  in  iR^n.  We  view  this 

process  as  parametrized  by  pairs  (tg.tj)  where  tg  <  t^  or  equivalently  by 

subintervals  (tg,tj).  Subintervals  are  partially  ordered  by  inclusion.  It  is  easy  to  see 
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that  the  original  process  x(t)  is  reciprocal  iff  the  two  time  process  X(tg,tj)  is  Markov 

:rj  ■  \ 

relative  to  this  partial  ordering. 

Alternatively  we  can  view  a  reciprocal  process  as  being  conditionally  Markov  in  the 
following  sense.  Given  any  tg  e  [0,T]  and  x^  e  IRn,  we  define  a  conditioned  process 

x(t|tQ,x^)  consisting  of  all  sample  paths  of  x(t)  satisfying  x(tg)  =  x^*  with  the 

conditional  probabilty  measure.  The  process  x(t)  is  said  to  be  conditionally  Markov  if 
every  x(t|tg,x^)  is  Markov  for  t  e  [0,tg]  and  is  also  Markov  for  t  e  [tg,T],  (It  need  not 

be  Markov  on  [0,T] . )  It  is  straightforward  to  note  that  a  process  x(t)  is  reciprocal  iff  it 
is  conditionally  Markov. 

To  essentially  specify  a  stochastic  process  one  must  describe  all  finite  dimensional 
distributions  of  the  process,  e.g.,  give  the  probability  distribution  of 
x(tg),x(tj),  .  .  .  ,x(tn)  where  0  <  t^  <  .  .  .  <  tn  <  T.  One  reason  that  Markov  processes 

are  so  well-studied  is  that  they  are  completely  determined  by  only  two  functions.  The 
first  is  /?q(x®),  the  probability  density  of  x(0).  (Throughout  we  assume  that  probability 

densities  exist  although  the  discussion  can  be  easily  extended  using  probability 
distributions.)  The  second  p(s,x;t,y)  is  the  Markov  transition  density  of  x(t)  =  y  given 
that  x(s)  =  x.  By  Bayes'  formula  the  probability  density  of  x(t^)  =  x1,  ...  ,  x(tfi)  =  xn 

where  0  <  tj  <  t2  <  .  .  .  <  tfl  <  T  is  given  by 

KtpX1,  .  .  .  ,  tn,xn)  =  J  p0{x°)  p(0,x°;t1  jX1 )  .  .  .  p(tn_j,xn_1;tn,xn)  dx° 

A  function  p(s,x;t,y)  is  a  Markov  transition  density  iff  it  satisfies  the  well-known 
Chapman— Kolmogorov  relations,  i.e. , 

/  p(s,x;t,y)  dy  =  1 


and 


p(s,x;u,z)  =  f  p(s,x;t,y)  p(t,y;u,z)  dy 

j  Kn 

where  0<s<t<u<T. 

There  is  a  similar  development  for  reciprocal  process  du..  to  Schrodinger  [2]  and 
Jamison  [5|.  A  reciprocal  process  x(t)  is  completely  determined  by  the  joint  density 
Pq  rp(x  )  of  the  end  points  x(0)  and  x(T)  and  a  reciprocal  transition  density 

qfs,x:l.,y;u,z).  'fhe  latter  is  the  probability  density  of  x(t)  =  y  given  that  x(s)  =  x  and 
x(uj  —  z  where  0  <s  <  t  <  u  <  T.  T  he  finite  dimensional  densities  of  x(t)  are  then  given 
by 
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/j(tpX  , .  .  .  ,  tn)x  )  —  J Pq  ^(x  ix  )  q(tQ,x  ;tpX  ;T,x  ) 

q(t1,x1;t2,x2;T,xT)  .  .  q(ltn_1,xn_1;tn>xn;T,xT)  dx°dxT 

To  be  a  reciprocal  transition  density,  q(s,x;t,y;u,z)  must  satisfy  the 
Schrodinger-Jamison  relations 

f  q(s,x;t,y;u,z)  dy  =  1 


and 


q(r,w;s,x;u,z)  q(s,x;t,y;u,z)  =  q(r,w;t,y;u,z)  q(r,w;s,x;t,y) 


where  0<r<s<t<u<T  and  w,x,y,z  £  IRn. 

Suppose  x(t)  is  a  reciprocal  process  and  X(tQ,t^)  is  the  associated  two  time  process 

which  is  Markov  relative  to  the  inclusion  partial  ordering.  One  can  show  that  the 
Chapman— Kolmogorov  relations  for  the  Markov  transition  density  of  X(tQ,t1)  are 

equivalent  to  the  Schrodinger-Jamison  relations  for  the  reciprocal  transition  density  of 
x(t). 

Schrodinger  realized  that  there  is  Bayesian  way  of  constructing  a  reciprocal  transition 
density  q  from  a  Markov  transition  density  p, 


q(s,x;t,y;u,z) 


p(s,x;t,y)  p(t.y;u,z) 
P(s,x;t,yJ 


Of  course  the  conditionally  Markov  property  allows  one  to  reverse  the  process  and  define 
a  Markov  transition  density  p  from  a  reciprocal  transition  density  q, 

p(s,x;t,y)  =  q(s,x;t,y;T,xT). 


If  we  start  with  a  reciprocal  transition  density  q,  which  we  use  to  define  a  Markov 
transition  density  p  which  we  use  to  define  another  reciprocal  density  q  then  by  the 
second  Schrodinger-Jamison  relation,  q  =  q.  If  we  start  with  a  Markov  transition 
density  p  which  we  use  to  define  a  reciprocal  transition  density  q  which  we  use  to  define 
another  Markov  transition  density  p,  it  does  not  follow  that  p  =  p. 

Schrodinger  used  a  Markov  transition  density  p  to  construct  reciprocal  transition 
density  q.  With  this  and  an  end  point  density  p q  ^  he  was  able  to  construct  reciprocal 

processes.  Jamison  [6]  showed  that  the  resulting  reciprocal  process  is  actually  Markov 
iff  the  end  point  density  satisfies 
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=  »o(x°)  P(°>x°iT>xT) 

0  T 

for  some  nonnegative  functions  Tq(x  )  and  x,p(x  ). 

Jamison  [6j  also  studied  one  dimensional  stationary  Gaussian  reciprocal  processes. 
He  showed  that  covariance  r(t)  of  such  a  process  must  satisfy  a  second  order  linear 
differential  equation 


where  a  is  a  constant.  He  then  used  this  in  an  attempt  to  classify  all  such  processes  and 
this  program  was  successfully  completed  by  Chay  [8]  and  Carmichael-Masse- 
Theodorescu  [11]. 

The  author  became  interested  in  reciprocal  process  through  his  study  of  acausal  linear 
systems  [14]  driven  by  white  noise  and  satisfying  independent  random  boundary 
conditions  of  the  form 

dx  =  A(t)  x  dt  +  B(t)  dw 

v  =  V°  x(0)  +  V1  x(t). 

Here  x(t)  is  an  n  dimensional  Gaussian  process,  w(t)  is  a  standard  n  dimensional  Wiener 
process  and  v  is  an  n  dimensional  random  vector  independent  of  w(t).  We  assume  that 
the  above  boundary  value  problem  is  well-posed  so  that  the  Green's  matrix  T(t,s) 
exists.  We  can  express  the  solution  of  the  stochastic  differential  equation  as 

.T 

x(t)  =  *(t,0)  v  4 -  f  r(t,s)  B(s)  dw(s) 

J  0 

where  the  integral  is  a  Wiener  integral  and  $(t,s)  is  the  fundamental  matrix  solution  of 
x  =  Ax.  We  have  normalized  so  that  V^1  +  V*  $(T,0)  =  I. 

We  have  proved  [14]  that  the  solution  of  such  a  stochastic  boundary  value  problem  is 
a  reciprocal  process  and  we  speculated  that  every  Gaussian  reciprocal  process  is  the 
solution  of  such  a  stochastic  boundary  value  problem.  This  conjecture  was  motivated  by 
the  fact  that  every  Gaussian  Markov  process  is  the  solution  of  a  stochastic  initial  value 
problem,  i.e.,  V®  =  I  and  V*  =  0.  This  conjecture  is  not  true  and  this  led  us  to  discover 
a  theory  of  reciprocal  diffusions  and  stochastic  differential  equations  of  second  order. 

3.  Diffusions 

We  recall  the  Feller  postulates  for  a  Markov  diffusion  x(t).  First  some  notation,  let 
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x  denote  the  transpose  of  a  n  dimensional  column  vector  x  and  x  the  outer  product  of 
x  with  itself  x  =  xx  ,  this  is  n  x  n  matrix.  The  forward  difference  operator  d  is 
defined  by 

d+x(t;dt)  =  x(t+dt)  -x(t) 

where  dt  >  0  is  a  small  positive  quantity.  Typically  we  suppress  arguments  as  in  d+x. 
Conditional  expectation  given  that  x(t)  =  x  is  denoted  by 

Ex(t)  (•>  =  E  (*  I  x(0  =  x) 

The  symbol  O(dt)  denotes  a  function  of  x,t  and  dt  for  which  there  exist  c,<5>0  such 
that  if  dt  <  6  then  |  O(dt)  |  <  e  dt^  for  all  x  e  IRn  and  t  e  (0,T).  The  symbol  o(dt)^ 
denotes  function  of  x,t  and  dt  which  for  every  e  >  0  there  exists  a  6  >  0  such  that  if 
dt  <  S  then  |o(dt)k|  <  e  dtK 

A  Markov  process  x(t)  is  a  Markov  diffusion  if  there  exists  n  x  1  and  n  x  m  valued 
functions  f(x,t)  and  g(x,t)  such  that 

(MD1)  Prob  {|  x(t+dt)  -x|  >  t  |  x(t)  =  x}  =  O(dt) 

(MD2)  Ex^  (d+x)  =  f(x,t)  dt  +  o(dt) 

(MD3)  Ex(t)  (d+x)*2  =  (g(x,t))*2  dt  +  o(dt) 

(MD4)  Third  and  higher  centered  conditional  moments  of  dx  vanish  like  o(dt). 

The  interpertation  of  these  postulates  is  that  conditioned  on  x(t)  =  x,  the  forward 

increment  d+x  of  the  process  has  a  mean  value  approximately  equal  to  f  dt  and  variance 

♦2 

approximately  equal  to  g  dt.  In  other  words  x(t)  is  mean  differential  but  the 
individual  sample  paths  are  not  for  they  have  an  extremely  large  standard  deviation 
O(dt)1/2. 

From  these  postulates  one  can  deduce  that  the  density  p(x,t)  of  x(t)  satisfies  the 
Fokker— Plank  equation 


Moreover  using  the  Ito  stochastic  integral  we  can  realize  x(t)  as  the  solution  of  the 
stochastic  differential  equation 

d+x  =  f(x,t)  dt  +  g(x,t)  d+w 

x(0)  =  x° 


f 
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We  now  sketch  out  the  foundations  of  a  parallel  theory  of  reciprocal  diffusions  that 
we  have  recently  developed.  More  details  can  be  found  in  [21].  We  need  some  more 
notation.  We  define  the  centered  average,  centered  first  difference  and  centered  second 
difference  as 

x(t;dt)  =  x(t+dt)  +  x(t=dt) 
dx(t;dt)  =  x(t+dt)  ~  x(t^_tj 
d2x(t;dt)  =  x(t+dt)  —2  x(t)  +  x(t-dt) 

Frequently  we  suppress  argument  as  in  x(t)  or  dx.  We  also  introduce  another 
conditional  expectation  operation 

Ex(t)  (')  =  E  (  '  I  x(t;dt)  =  x) 

A  reciprocal  process  x(t)  is  a  reciprocal  diffusion  if  there  exists  n  x  1  valued  functions 
f(x,t)  and  u(x,t),  n  x  n  valued  functions  g(x,t)  and  r(x,t)  and  n  x  m  valued  function 
h(x,t)  such  that 

(RD1)  Prob  {  |x(t)  —  xj  >  c  |  x(t;dt)  =  x  }  =  O(dt) 

(RD2)  E-(t)  (dx)  =  u(x,t)  dt  +  o(dt) 

(RD3)  E_(t)  (d2x)  =  (f(x,t)  +  g(x,t)  u(x,t))  dt2  +  o(dt)2 

(RD4)  E-(t)  (dx)*2  =  \  (h(x,t))*2  dt  +  r(x,t)  dt2  +  o(dt)2 

(RD5)  E-(t^  (d2x)*2  =  2  (h(x,t))*2  dt  +  o(dt)2 

(RD6)  E-(t)  (d2x  dx*)  =  \  g(x,t)(h(x,t))*2  dt2  +  o(dt)2 

2 

(RD7)  Third  and  higher  joint  centered  conditional  moments  of  dx  and  d  x  vanish  like 
o(dt)2 

Basically  these  postulates  assert  that  the  first  and  second  joint  conditional  moments 
o 

of  dx  and  d  x  exist  and  have  the  indicated  expansions  in  power  series  in  dt.  They  define 
the  coefficients  f,g,h,u  and  x  of  the  power  series  and  they  imply  certain  relation  between 
these  coefficients.  These  definitions  and  relations  are  as  follows: 

(i)  The  dt  part  of  RD2  defines  u. 

(ii)  The  dt2  part  of  RD3  defines  f  +  g  u. 

(iii)  The  dt  and  dt2  parts  of  RD4  defines  h*2  and  r. 

(iv)  The  dt2  part  of  RD6  defines  g  h*2. 
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(v)  The  dt  part  of  RD3  vanishes. 

(vi)  The  dt2  part  of  RD5  vanishes  and  the  dt  part  is  four  times  the  dt  part  of  RD4. 

Any  process  satisfying  (RD1— 6)  is  called  a  second  order  diffusion. 

We  refer  to  u  as  the  mean  velocity,  p  u  as  the  mean  momentum,  f  +  g  u  as  the  mean 
acceleration,  h  as  the  noise  coefficient  and  p  r  as  the  mean  momentum  flux  of  the 
process  x(t).  A  related  quantity  p  a  —  p{ u  u*  —  j)  is  called  the  stress  tensor.  The 
reason  for  the  terminology  will  become  apparent  in  a  moment. 

A  reciprocal  diffusion  satisfying  RDl— 7  is  said  to  be  a  solution  of  the  second  order 
stochastic  differential  equation. 

d2x  =  f(x,t)  dt2  +  g(x,t)  dx  dt  +  h(x,t)  d2w 

where  w(t)  is  a  standard  m  dimensional  Wiener  process.  This  is  a  (partial)  mnemonic 

for  the  above  portulates.  In  particular  applying  E-^  (•)  we  obtain  RD3  from  RD2 

2  2  *2 
under  the  assumption  that  d  w  is  independent  of  x(t).  Applying  E-^  ( • )  to  (d  x) 

yields  RD5.  Finally  RD6  follows  from  applying  E-^  (•)  to  d  x  dx  using  RD4. 

To  get  a  feeling  for  these  axioms  it  is  convenient  to  introduce  another  conditional 
expectation 

Ex(t±dt)  (•)  =  E  (  I  x(t±dt)  =  x±v  dt  ) 

Suppose  x(t)  is  a  reciprocal  diffusion  which  also  satisfies  the  stronger  conditions. 

(RD3*)  Ex^t±dg  (d2x)  =  { f ( x , t )  +  g(x,t)  v)  dt2  +  o(dt)2 

(RD4*)  Ex(t±dt)  (d2x)*2  =  2(h(x,t))*2  dt  +  o(dt)2 

(RD6)  E-(t)  (d2x  dx*)  =  \  g(x,t)(h(x,t))*2  dt? 

+  f(x,t)  u(x,t)*  +  g(x,t)  r(x,t)  dt"*  +  o(dt)"* 

Then  x(t)  is  called  a  strongly  reciprocal  diffusion. 

Conditioned  on  x(t±dt)  =  x±v  dt  the  mean  sample  path  of  the  process  over  the  time 
interval  [t — dt,  t+dt.]  traces  out  a  parabola  in  (t,x)  space  passing  through  (udt,  x±vdfi 
and  with  second  derivative  equal  to  f(x,t)  +  g(x,t)  v.  Hence  the  mean  path  deviates 

o 

from  the  straight  line  between  (t±dt,  x±v  dt)  by  O(dt)  .  Compared  to  this,  the 
standard  deviation  of  sample  paths  from  the  mean  path  is  very  large,  O(dt)^2. 

Conditioning  on  x(t;dt)  =  x  rather  than  x(t)  =  x  is  crucial  to  the  above  development. 
Even  for  very  nice  processes,  such  as  an  Ornstein  Uhlenbeck  process,  the  quantity 
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E  ,  ,  (d  x)  is  0(dt)  rather  than  0(dt).  In  the  stochastic  mechanics  of  Nelson  [18]  the 
xftj 

dt  part  of  this  quantity  is  twice  the  osmotic  velocity.  Nelson's  current  velocity,  the  dt 
part  of  (dx)  is  generally  equal  to  our  mean  velocity  u(x,t)  from  (RD2). 

The  first  question  that  comes  to  mind  is  "Are  there  any  reciprocal  diffusions?".  In 
[21]  we  showed  that  answer  is  decidely  yes.  In  particular  we  showed  that  any  reciprocal 
Gaussian  process  with  smooth  covariance  R(t,s)  satisfying  certain  technical  conditions  is 
a  strongly  reciprocal  diffusion.  This  includes  such  Markov  processes.  For  a  Gaussian 
reciprocal  process  the  second  order  stochastic  differential  is  linear  of  the  form 

d~x  =  F(t)  x  dt^  +  G(t)  dx  dt  +  H(t)  d^w 


where 


f(x,t)  =  F(t)  x  =  (t,t)  -  G(t)  (t,t)  x 

dt 


g(x,t)  =  G(t)=fe(t,t)-^  (t.t)|fe(t,t)-ff  (t,t)| 
dtL  ds  0 


(h(x,t))‘2  =  (H(t))*2  =  -  §■  (t,t)  -  §  (t,t) 


The  principle  technical  conditions  are  that  R(t,t)  =  I  and  H(t)  is  invertible.  The  other 
quantities  u(x,t)  and  ir(x,t)  are  given  by 


u(x,t)  —  G(t)  x  —  ^  (M)  +  (t>t)  x 


r(x,t)  =  u(x,t)  u*(x,t)  —  a(x,t) 

■’(*,»)  -  -  2  ins  <‘.l>  +  W))  +  u<‘>  u*(‘)- 

All  of  the  above  evaluations  are  at  s  =  t- 

Suppose  x(t)  is  Gaussian  process  and  a  solution  of  the  first  order  stochastic  boundary 
value  problem 

dx  =  A(t)  x  dt  +  B(t)  dw 

v  =  V°  x(0)  +  V1  x(t) 
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in  the  sense  defined  above  using  the  Green's  matrix.  Assume  R(t,t)  =  I  and  B(t)  is 
invertible.  Then  x(t)  is  a  reciprocal  diffusion  satisfying  the  second  order  linear 
stochastic  differential  equation  above  with 

H(t)  =  B(t) 

G(t)  =  -  (A2(t)  -  A*2(t)  +  A(t)  -  A*(t))  (B(t)  B*(t))— 1 
F(t)  =  A2(t)  +  A(t)  —  G(t)  A(t). 

Because  of  the  complexity  of  these  relations,  it  is  possible  for  a  process  to  satisfy  a 
relatively  simple  first  order  equation  and  a  relatively  complicated  second  order  equation 
or  vice  versa.  The  latter  is  the  case  for  the  Brownian  Bridge  or  pinned  Wiener  process 
x(t)  which  satisfies  the  first  order  equation 

d+x  ~  (l^TJ  x  dt  +  d+w 

x(0)  -  0 

and  the  second  order  equation 
d2x  =  d2w 
x(0)  =  x(l)  =  0. 

The  density  p  of  a  Markov  diffusion  satisfies  the  Fokker— Plank  equation.  For  a 
strongly  reciprocal  diffusion  the  density  p,  mean  momentum  p  u  and  mean  momentum 
flux  p  j  satisfy  at  least  in  a  weak  sense  a  system  of  hyperbolic  conservation  laws  similar 
to  the  continuity,  Euler  and  kinetic  energy  balance  equations  of  fluid  mechanics.  They 
are 

!(pui)  =  p(f  +  gu)i-f-(p  Tjk) 

k 

and 

|t  Tjj)  =  P  (f  u*  +  uf*  +  g*  +  *8*)jj 

i  uj  uk  -  ^ij  uk  -  ^ik  uj  ~  ajk  ui)) 
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with  summation  on  repeated  indices  understood.  A  second  order  or  reciprocal  diffusion 
need  only  satisfy  the  first  two  of  these  equations. 

Suppose  we  consider  a  volume  with  boundary  in  x-space.  If  we  integrate  p  over  this 
volume  we  obtain  the  probability  measure  of  the  volume.  The  first  equation  states  that 
the  time  rate  of  change  of  the  probability  of  the  volume  is  equal  to  the  flux  of  particles 
through  the  boundary  due  to  the  mean  velocity. 

If  we  integrate  p  u  over  the  volume,  we  obtain  the  total  momentum  in  the  volume. 
The  second  equation  states  that  the  time  rate  of  change  of  momentum  in  the  volume  is 
equal  to  the  forces  acting  on  the  particles  in  the  volume  plus  the  net  flux  of  momentum 
through  the  boundary. 

If  we  integrate  p  r  over  the  volume  we  obtain  the  total  momentum  flux  in  the 
volume.  Physically  this  is  somewhat  hard  to  comprehend  but  for  smooth  processes  the 
contraction  ^  p  is  the  kinetic  energy.  Hence  we  view  ^  /?  Xjj  as  a  tensor  form  of 

kinetic  energy.  More  precisely,  if  A.  is  a  constant  n  vector  then  the  scalar  valued 

process  z(t)  =  A-x.(t)  has  kinetic  energy  equal  to  bp  t-.  AA-.  With  this  interpertation 
1  "  1J  1  J 

the  third  equation  states  that  time  rate  of  change  of  tensor  kinetic  energy  in  the  volume 

2  2 

is  equal  to  the  mean  work  done  on  the  particles  in  the  volume  by  the  force  d  x/dt 
acting  through  the  distance  dx  plus  the  flux  of  tensor  kinetic  energy  through  the  surface 
of  the  volume.  This  flux  is  due  to  mean  tensor  kinetic  energy  i  p  u-  u.  (called  internal 

energy)  transported  by  mean  velocity  uk,  random  tensor  kinetic  energy  -  \p  o jj 

transported  by  mean  velocity  uk  and  mixed  random/mean  kinetic  energy  transported  by 

random  velocity.  The  latter  represented  by  the  last  two  terms  of  the  flux  are  usually 
described  as  viscosity  or  stress  terms  in  fluid  dynamics.  They  represent  the  transport  of 
energy  due  to  random  jumps  of  particles  between  regions  of  differing  mean  velocity. 

The  third  equation  expresses  kinetic  energy  balance  at  the  standard  time  scale,  i.e., 
the  dt  part  of  E-^  (dx)  .  There  is  also  a  form  of  energy  at  a  fast  time  scale,  i.e.,  the 
*2 

dt  part  E-^  (dx)  .  We  call  this  hyperkinetic  energy  and  its  balance  is  described  by 
another  conservation  law 

ft  (P  h  h*)jj  =  §  (g  h  h*  +  h  h*  g*)^ 

-|^(/5(h  h*)jjUk) 


which  is  also  satisfied  by  sco-uo  order  diffusions. 

Notice  that  the  first  three  equations  can  be  viewed  independently  of  this  last.  We 
chose  the  name  "hyperkinetic"  to  suggest  a  hyperkinetic  child  sitting  at  his  school  desk 
whose  endless  fidgeting  is  to  no  net  effect  (except  possibly  on  his  teacher). 
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In  [21]  we  formally  derived  the  four  conservation  laws  from  the  postulates  of  a 
strongly  reciprocal  diffusion.  Although  they  can  be  thought  of  in  physical  terms,  they 
are  not  consequences  of  physical  principles  or  assumptions.  We  verified  that  these 
conservation  laws  are  satisfied  by  the  reciprocal  Gaussian  processes  discussed  above. 
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We  develop  a  theory  of  second  order  diffusion  processes  and  associated  stochastic 
differential  equations  of  second  order.  We  show  that  equations  of  evolution  of  the 
density,  mean  velocity  and  momentum  flux  are  a  family  of  first  order  conservation 
laws  similar  to  those  of  continuum  mechanics.  We  verify  that  the  theory  is  satisfied 
for  a  large  class  of  reciprocal  Gaussian  processes. 
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1.  RECIPROCAL  DIFFUSIONS 

One  of  the  most  beautiful  parts  of  modern  mathematics  is  the  rich 
and  wonderful  interplay  between  Markov  diffusion  processes,  linear 
parabolic  partial  differential  equations  and  stochastic  differential 
equations  of  first  order.  We  shall  describe  the  foundations  of  a 
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parallel  theory  involving  reciprocal  diffusion  processes,  nonlinear 
conservation  laws  and  stochastic  differential  equations  of  second 
order. 

Let  (Q,  J*,  Pr)  be  a  probability  triple  consisting  of  a  sample  space 
fi,  cr-algebra  of  events  and  probability  measure  Pr.  Let  E  denote 
expectation  with  respect  to  Pr.  Throughout  we  let  x(t)  denote  a 
stochastic  process  over  this  triple  defined  for  te[0,  T]  and  taking 
values  in  IR"*1.  We  assume  that 

T 

E  J  |x(t)|2dt<oo. 
o 


Given  O^to^t^T,  let  and  9S(t0,t ,)  be  the  a 

subalgebras  of  &  generated  by  x(r)  interior  to,  exterior  to  and  on 
the  boundary  of  the  interval  defined  by  t0  and  f,.  In  other  words 

<^'(t0,r1)  =  a{x(t):te[t0,f1]} 

^(t0,t1)  =  <T{x(t):t6[0,t0]u[t,,T]} 

We  denote  by  «^(t0, «i),  and  ^(£0>*i)  the  space  of  square 

integrable  random  variables  which  are  measurable  with  respect  to 
^(t0,ti)  and  respectively. 

The  concept  of  a  reciprocal  process  was  introduced  by  Bernstein 
[1]  following  ideas  of  Schrodinger  [2,3].  A  process  x(r)  is  reciprocal 
if  on  every  subinterval  of  [0,  T],  the  interior  and  exterior  are 
conditional  independent  given  the  boundary.  More  precisely,  if 
and  then 

E{4»p  1 39(t0,  t ,))  =  E(<p  1 8{t0,  t  ,))E(iA  |  8(t0, 1 1)). 

We  refer  the  reader  to  [4-17]  for  more  detailed  discussions  of 
reciprocal  processes.  The  have  also  been  called  quasi  Markov  or 
Bernstein  processes.  They  are  closely  related  to  conditionally  Markov 
processes.  Following  Schrodinger’s  original  motivation  and  Nelson’s 
stochastic  mechanics,  Zambrini  [20-22]  has  related  reciprocal 
processes  to  quantum  mechanics. 
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It  follows  immediately  from  the  definition  that  Markov  processes 
are  reciprocal  but  not  vice  versa  [5]. 

To  define  reciprocal  diffusions  we  must  introduce  some  notation. 
Given  a  process  x(t)  and  a  small  time  increment  dt>  0,  define  the 
centered  average  evaluation  x,  centered  first  difference  dx  and 
centered  second  difference  d2x  as 


x(t;  dt)  = 


x(t  +  dt)  +  x(t  —  dt) 


dx(t;  dt)- 


x(t + dt) — x(t — dt) 


d2x(f,  dt)  =  x{t  +  dt)  —  2x(t)  +  x(t  —  dt). 


Frequently  when  the  context  is  clear  we  suppress  the  argument  dt  as 
in  x(r),  or  both  t  and  dt  as  in  dx.  We  also  have  the  forward  d+x  and 
backward  d~x  first  differences 

d+ x(t;dt)=x(t+dt)— x(r) 

d~x(t\  dt)  —  x{t)—x(t~dt). 


In  contrast  to  the  standard  conditional  expectation  of  Markov 
theory. 


£,(•)  =  £*, ,(-)  =  £(-|x(t)  =  x) 


we  shall  utilize 


£*(•)  =  £*(  i;  <«)(  •)  =  £(•  I  x{t;  dt)  =  x) 
and  occasionally  the  stronger  conditioning 


Ei.dxi )  =  Ex<,±<io( )  =  E{-\x(t±dt)  =  x±v  dt) 
=  £(  •  |  x(t ;  dt)  =  x,dx(t,dt)  =  v  dt). 


396 


A.  J.  KRENER 


We  now  give  the  second  order  analogs  of  the  Feller  postulates  for 
a  first  order  diffusion.  A  stochastic  process  x(t)  is  a  second  order 
diffusion  if  there  exist  functions  f^x,  t),  gtj[x,  t),  h^x,  t),  u/x,  t)  for 


i,  j=\,...,m  such  that 

E^dXi)  =  u,{x,  t)  dt  +  o(dt)  (1.1a) 

E^dxi  dxj)  -  %hik(x,  t)hJk(x,  t )  dt  +  n{J(x,  t)dt2+ o(dt)2  ( 1 . 1  b) 

E^2Xi)={f,{x,t)+gi^x,t)u^x,t))dt2  +o(dt)2  (1.1c) 

E^d2xid2xJ)  =  2hik(x,t)hJk{x,t)dt  +o(dt)2  (lid) 

Ei(d2xi  dxj) = ig,*(x,  t)hkl(x,  t)hj,(x,  t)  dt2 + o(dt) 2  (l.le) 

The  higher  conditional  moments  of  dx  and  d2x  agree 
to  the  lowest  nonzero  powers  of  dt  with  those  of 
Gaussians  with  the  above  first  and  second  moments.  (110 


In  the  above  we  have  utilized  the  summation  convention.  Con¬ 
ditioned  on  x(t;dt)=x  for  fixed  x,t,  the  expression  o(dt)k  is  a 
deterministic  quantity  y(dt;x,t)  which  vanishes  faster  than  dt*  as 
dt-* 0  uniformly  in  x  and  t.  In  other  words  for  every  e>0  there  exist 
<5>0  such  that  \y{dt;  x,  t)|  <  e  dt*  for  all  dt<S,xeW  and  te(0,  T).  We 
denote  by  0(dt)*  a  quantity  y(df;x,t)  for  which  there  exist  e,S>0 
such  that  |y(dt;x,t)|<fidt*  if  dt<S. 

If  x(t)  is  both  a  reciprocal  process  and  a  second  order  diffusion 
then  we  say  it  is  a  reciprocal  diffusion. 

A  stochastic  process  x(t)  satisfies  the  second  order  stochastic 
differential  equation 

d2x  =  f(x,  t)  dt2  +  g{x,  t)  dx  dt  +  h(x,  t)  d2w  (1.2) 

where  w(t)  is  a  standard  m  dimensional  Wiener  process  if  x(t)  is  a 
reciprocal  diffusion  satisfying  (1.1a— f).  Actually  (1.2)  is  a  mnemonic 
description  of  (1.1)  in  the  same  way  that  the  first  order  stochastic 
differential  equation 

d+x  =  /(x,r)d£  +  /i(x,t)d+w  (1.3) 

is  a  mnemonic  for  the  axioms  of  a  first  order  diffusion. 
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Ejld+xi)  =  fl{x,t)dt+o{dt)  (1.4a) 

Ex(d + x( d + Xj)  =  hik{x,  t)hjk{x ,  t)  dt  +  o(dt)  ( 1 .4b) 

all  higher  centered  conditional  moments  of  dx  vanish 

like  o{dt).  (1.4c) 


In  particular  (1.1c)  asserts  that  conditional  mean  acceleration 
equals  f +gu  where  u  is  the  conditional  mean  velocity  given  by 
(1.1a).  Note  the  difference  between  the  conditional  expectations  in 
(1.2)  and  (1.4).  Conditioning  on  x(t\dt)  =  x  is  an  essential  part  of  the 
second  order  stochastic  calculus.  If  we  were  to  condition  on  x(t)  =  x, 
we  would  find  that  generally  £*(d2x)  is  order  dt  rather  than  dt2.  In 
fact  it  is  precisely  this  conditioning  which  distinguishes  our  work 
from  that  of  Nelson  [18]  and  Zambrini  [20-22].  In  Nelson’s 
stochastic  mechanics  the  order  dt  part  of  Ej(dx)  is  a  vector  field 
u(x,  r)  called  the  current  velocity  while  the  order  dt  part  of  1/2 Ex(d2x) 
is  another  vector  field  u(x,  t)  called  the  osmotic  velocity.  In  our  work 
the  order  dt  part  of  E£dx)  is  a  vector  field  u(x,t )  called  the 
conditional  mean  velocity.  For  a  Gaussian  process  with  a  smooth 
covariance.  Nelson’s  current  velocity  equals  our  conditional  mean 
velocity  and  we  suspect  that  this  is  true  whenever  both  exist.  On  the 
other  hand  in  our  theory  E^d2x)  is  postulated  to  be  of  order  dt2. 
Hence  the  coefficient  of  dt2  can  be  viewed  as  an  acceleration.  For 
this  reason  it  differs  from  Nelson’s  osmotic  velocity.  Zambrini’s  work 
also  uses  the  Nelson  framework. 

We  call  the  n  x  1  vector  field  u(x,  t)  the  mean  velocity.  The  density 
of  x(r)  is  denoted  by  p(x, t).  The  nxn  tensor  field  p(x, t)  jr(x, t)  is 
called  the  momentum  flux  tensor.  A  related  nxn  tensor  field,  p(x,  t) 
oi}{x,  t)  =  p(x,  t)(Uj(x,  t)  uj{x ,  t)  —  7r,-/x,  t)),  is  called  the  stress  tensor.  The 
reason  for  these  names  will  become  apparent  in  Section  Four. 

Formulas  (1.1,2)  suggest  that  the  random  parts  of  dx  and  d2x 
conditioned  on  x(t;dt)  =  x  are  given  by 

dx  =  dx  —  E^dx)  =  dx  —  u(x,  t)  dt 

d2x  =  d2x  —  £i(d2x)  =g(x,  t)dx  +  h(x,  t)  d2w. 

We  use  x*  denote  the  transpose  of  x  and  x*2  to  denote  the 
symmetric  square  or  outer  product,  x*2  =  xx*. 
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Note  that  u(x,t)  and  7i(x,t)  from  (1.1)  do  not  appear  explicitly  in 
(1.2).  As  we  shall  see  in  Section  4,  this  is  because  these  quantities  and 
the  density  p(x,t)  satisfy  a  system  of  nonlinear  conservation  laws 
determined  by  /,  g  and  h.  This  system  of  four  first  order  partial 
differential  equations  is  very  similar  to  the  equations  of  fluid  and 
continuum  mechanics.  They  express  conservation  of  probability, 
balance  of  momentum  and  balance  of  a  tensor  form  of  work  in  two 
time  scales.  They  replace  the  familiar  Fokker-Planck  equation  for  a 
first  order  diffusion  (1.3,4). 

Equations  (1.1a -e)  assert  that  the  conditional  moments  of  dx  and 
d2x  can  be  expanded  in  powers  of  dt  as  shown.  These  formulas  give 
names  to  the  coefficients.  The  only  constraints  on  the  coefficients  are 
found  by  comparing  (l.lb,d  and  e).  Equation  (1.1a)  asserts  that  the 
conditional  mean  velocity  exists  and  gives  it  a  name  u(x,  t).  Equation 
(1.1b)  describes  the  variance  of  dx.  The  order  dt  part  arises  from  the 
fluctuation  of  the  second  difference  d2w  ( t;dt )  of  the  Wiener  process 
that  appears  in  (1.2).  The  factors  of  1/2  and  2  in  the  dt  part  of 
(1.1b,  d)  are  explained  by 


E^dw)*2  =\I  dt 
Exid2w)*2  =  2Idt. 

The  second  order  part  ndt2  of  (1.1b)  has  a  deterministic  contribution 
u*2dt2  from  (1.1a)  and  a  stochastic  contribution  —  odt2  = 
(n  —  u*2)dt2  from  the  noise  throughout  [0,7/]. 

The  immediate  question  that  arises  is  “Are  there  any  second  order 
or  reciprocal  diffusions?”.  We  answer  this  in  the  affirmative  in  the 
next  section  by  showing  that  under  mild  technical  assumptions 
Gaussian  processes  with  smooth  covariances  are  second  order  dif¬ 
fusions,  and  Gaussian  reciprocal  processes  with  smooth  covariances 
are  reciprocal  diffusions.  Of  course  the  latter  includes  Gauss-Markov 
processes.  We  derive  explicit  formulas  for  the  quantities  appearing 
in  (1.1)  in  terms  of  the  covariance  matrix  of  the  process. 

In  Section  Three  we  explore  how  second  order  diffusions  trans¬ 
form  under  change  of  variables  and  in  Section  Four  we  derive  the 
conservation  and  balance  laws  which  are  described  above.  In  Section 
5  we  verify  that  Gaussian  reciprocal  processes  of  Section  2  satisfy 
these  laws. 
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2.  RECIPROCAL  AND  GAUSSIAN  PROCESSES 

Let  x(t)  be  a  Gaussian  process  defined  on  [0,  T\  and  taking  values 
in  R"XI.  For  convenience  we  assume  x(t)  is  zero  mean  and  we 
denote  by  R(t,s)  its  covariance  matrix, 

E(x(t))= 0 

£(x(r)x*(s))  =  /?(t,s). 

We  shall  assume  that  R(t,s)  is  a  smooth  (C°°)  function  of  t,s  in  the 
triangle  and  the  limits  of  R  and  its  partial  derivatives 

exist  and  are  continuous  on  the  closed  triangle.  Because 
R(t,s)  =  R*(s,t)  we  need  only  consider  R  in  this  triangle. 

We  shall  also  assume  that 


R(t,t)  =  l 


R(t  + 


I  R*(t  +  z,t-x)~ 1.  . 

is  nonsmgular  for  small  t>0 

/  J 


(2.1a) 

(2.1b) 


dR,  v  dR*  x  n 

1r(t,t)  +  -—(t,t)<  0. 

ot  os 


(2.1c) 


All  evaluations  of  R  and  its  partials  at  t=s  are  limits  of  values  in  the 
interior  of  the  triangle  0 Ss^t^T.  Notice  that  (2.1a)  is  merely  a 
normalization  assuming  R(t,t)  is  invertible.  Moreover  (2.1c)  essen¬ 
tially  implies  (2.1a,  b)  holds  for  almost  all  t. 

In  [15]  we  showed  that  any  stationary  reciprocal  Gaussian 
process  satisfying  (2.1)  has  a  C®  covariance  in  the  above  sense  and 
moreover  the  covariance  R(t  —  s)  =  R(t,s)  satisfies  a  pair  of  second 
order  matrix  differential  equations 

K  =  Gk  +  FR  (2.2a) 

K*=-Gk*  +  FR*.  (2.2b) 

We  now  extend  this  to  the  nonstationary  case.  We  refer  the  reader 
to  [14,15]  for  full  details. 
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Let  Q^s<t<T  then  by  the  Gaussian  and  reciprocal  properties 
the  covariance  R(t,s)  satisfies  for  r>0  sufficiently  small 

R’o+M-tir1 

|_K(t  +  T,t-T)  /  | 

T,S)-| 

t,s)  J 


R(t  —  - 
R(t  +  i 

We  define  Kt{t,  t)  in  the  obvious  fashion  so  that  this  becomes 
R(t,s)=lKt(t,z)K2(t, 


(2.3) 


.  nr^-T,sr 

t.  Til 


We  differentiate  this  twice  with  respect  to  t  to  obtain 


O^X^t)*^)] 


dt 


2  /?(t  4“  T,  s) 


+  2 


r  dKt/  dK2/  1 


—  X(r-r,s) 
^/?(r  +  r,s) 


d2x^  _x<?2x2 


dr 


■  '_![*(» +  t,s)_T 


By  (2.1)  and  arguments  similar  to  those  of  [15]  we  verify  that  the 
limits  of  K(t, r),  d/dx K{t,x)  and  (d/dx)2K(t, x)  exist  as  i-*-0.  In  particu¬ 
lar  K,(t,0)  =  %I  and  so  we  obtain  for  all  0^s<t<T 


a2 

dt 


dR 

~dt 


2R(t,s)  =  G(t)—(t,s)  +  F(t)R(t,s) 


where 


dK2i 


G(r)=-2f^-i(r,0)--^(t,0) 


(2.4) 


(2.5a) 
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(d2K,  d2K2  \ 

F(t)  =  -  f  -fV-  (f,  0)  +  -  - -/  (f,  0)  j .  (2.5b) 

But  notice  in  our  derivation  of  (2.4)  we  did  not  use  the  fact  that 
t>s,  we  only  used  the  fact  that  fe(f-T,f  +  r)  and  si(t -T,t  +  t). 
Hence  (2.4)  must  also  hold  for  0<t<s^T.  Since  R(t,s)-  R*(s, t)  we 
conclude  that  for  0<t  <s^T, 

d2  dR* 

—2R*(s,  t)  =  G(t)  —  (s,  t)  +  F(t)R*(s,  t). 

By  interchanging  the  symbols  t  and  s,  we  see  that  for  0<s<t^T 
we  have 

e2  dR* 

j-2  R*{t, s)  =  G(s)  —  (f, s)  +  F{s)R*(t ,  s)  (2.6) 


By  continuity  (2.4)  and  (2.6)  must  hold  for  0 ^s^t^T.  We  also 
obtain  alternative  formulas  for  G(t)  and  F(t),  namely 


G(t)  = 


d2R .  .  d2R * 

at2(r,t)  a*2  (u) 


a/? 

^(t,r)-— (u) 


(2.7a) 


F(0  =  -^(M)-G(0-^-(',t)  = 


a2/?* 

as2 


r(t,t)-G(t)--(t,t)  (2.7b) 
os 


We  define  H(t)H*{t)  by 

{dR  dR*  \ 

H(t)H*(t)  =  - (  0 -  (f.  0  j-  (2.7c) 

We  have  proved 

Theorem  2.1  Let  x(t)  be  a  zero  mean  Gaussian  reciprocal  process 
with  smooth  covariance  R(t,s)  satisfying  (2.1).  Then  R(t,s)  satisfies  the 
matrix  differential  equations  (2.4)  and  (2.6)  on  O^s^t^T  where  F(t) 
and  G(t)  are  given  by  {2.1a,  b). 

Every  Gauss-Markov  process  x(r)  with  a  smooth  covariance  is  an 
Ornstein-Uhlenbeck  process,  i.e.,  a  solution  of  a  first  order  linear 
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stochastic  differential  equation  of  the  form 


d+x  =  F(t)x  dt  +  H(t)  d+w. 

The  next  theorem  shows  that  every  Gaussian  reciprocal  process  x(t) 
with  smooth  covariance  satisfying  (2.1)  is  a  solution  of  a  second 
order  linear  stochastic  differential  equation 

d2x  =  F(t)x  dt2  +  G(t)  dx  dt  +  H(t)  d2w. 

Theorem  2.2  Let  x(f)  be  a  Gaussian  process  with  smooth  covariance 
R(tys)  satisfying  (2.1).  Then  x(t)  is  a  second  order  diffusion.  If  x(t)  is 
also  reciprocal  then  it  is  a  reciprocal  diffusion.  In  either  case 
f(x,t)  =  F(t)x,  g(x,t)  =  G(t)  and  h(x,t)  =  H(t)  of  (2.1)  and  u(x,t),  n(x,t) 
are  given  by 


u(x,  t)  —  U(t)x  (2.8a) 

(18b) 

7r(x,  r)  =  u(x,  t)u*(x,  t)  —  rr(x,  t)  (2.8c) 

1  /  d2R  d2R *  \ 

o(x,  t)  =  <7(t)  +  —  U(t)  U*(t).  (2.8d) 

Proof  The  proof  of  this  theorem  is  a  straightforward  exercise  in 
computing  conditional  expectations  of  Gaussian  random  variables. 
We  shall  sketch  the  details. 

By  assumption  (2.1) 


E(x(t))*2  =  R(t,  t)  =  I 


(2.9a) 


so 


dR  dR 

°-ar(M)+&(I’t) 


(2.9b) 
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and 


d2R/  „ d2R  d2R 

(t,  t)  +  2  (t,  t)  +  (t,  t). 


dt 


dt  ds 


8  s 


(2.9c) 


In  particular  (2.9b)  insures  that  HH*  (2.7a)  is  symmetric  and  V(t) 
(2.8b)  is  skew  symmetric. 

Next 


E(dx(t))*2  =  i(R{t  +  dt,t  +  dt)  -  R(t  +  dt,t  —  dt) 
-R*(t  +  dt,  t-dt)  +  R(t  —  dt,t  —  dt)) 
By  Taylor  series  expansion  and  (2.9)  we  obtain 


In  a  similar  fashion  we  obtain 

,  1 (8R  8R*  \ 

E(x(t))*2  =  1  +  2  (  (*« f)  +  (f>  t))dt  +  o(dt) 

E(d2x(t))*2  =-2 (t,  t)  -  ~  (t,  t)\ dt  +  o(dt)2 

1 [8R  8R*  \ 

E(dx(t)x(t)*)  =  -  /  --  (f,  t)  +  — -  (t,  t)jdt  +  o(dt) 

E{d2x(t)x(t)*)  =  ^  ( t ,  t)  +  o)  dt2  +  °(dt)2 

,  1  [82R  d2R*  \  , 

E(d2x(t)dx(t)*)=  -  -f  -^j(t,t)~  -^-(t.Oj* 


(2.10a) 

(2.10b) 

(2.10c) 

(2. 1  Od) 

(2.10e) 
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(M)  + 


d3R*\ 

dtds2) 


dt3+o(d>)3.  (2.1  Of) 


Therefore  we  obtain 

Ex{dx )  =  E{dxx(t)*)(E{x(t))*2)  -  lx 

=  ^  (t,  t)  +  ( t ,  t^j  x  dt  +  o(dt) 

=  U(t)xdt  +  o(dt)  (2.11a) 

and  in  a  similar  fashion 

E^d2x)  =  l-  t)  +  ('>  0^  dt2  +  o(dt)2 

= (F{t)  +  G(t)  U(t))x  dt2+o(dt)2.  (2. 1 1  b) 

One  can  also  show  that 

E£dxj{d2x)=(F(t)x  +  G(r)u)  dt2  +o(dt)2.  (2.11c) 

To  obtain  the  conditional  second  moments  of  dx  and  d2x,  we 
utilize  the  particular  property  of  Gaussian  random  variables  that  the 
conditional  variance  is  independent  of  the  conditioning  and  so 

Exidx  -  Exldx))*2  =  E(dx  -  Exldx))*2. 


Hence 


Exidx)*2  =  E(dx)*2  +(Exldx))*2  -  E(Exidx))*2 


lfdR/  dR\  \  ( 1  /  d2R  d2R *  ; 

2  a(M)-ir(M)JA+  2(s&(u)+a*<m), 


+  U(t)(xx*  -  I)U*(t)  1  dt2  +  o(dt)2 


+ (F(t)  +  G(t)  U(t))(xx*  -  I)  U*(t)j  dt 3  +  o(dt)3 

=  $G(t)H{t)H{t)*  dt2+o(dt)2.  (2.1 2d) 

If  x(f)  reciprocal  then  by  utilizing  the  sum  of  partials  of  (2.4)  and 
(2.6)  with  respect  to  s  and  t  respectively  we  obtain 

Exid2x  dx*)  =  \G(t)H(t)H*(t)  dt2 

+  ((F(t)  +  G(t)U(t))xx*U*(t) 

-G(r)cr(t))  dt3  +  o(dt)3.  (2.1 2e) 


Of  course  (l.lf)  follows  from  the  Gaussian  assumption.  Q.E.D. 

It  is  enlightening  to  apply  the  above  formulas  to  particular  classes 
of  reciprocal  processes.  For  example  suppose  x(t)  is  a  stationary 
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Gauss-Markov  process  satisfying  (2.1)  with  covariance 
R(t,s)  =  R(t—s).  It  is  well-known  that  R(t)  =  exp(At )  where  the 
spectrum  of  A  lies  in  the  open  left  half  of  the  complex  plane  and  that 
x(t)  satisfies  on  [0.  oo]  the  first  order  stochastic  differential  equation 


d*x  =  Axdt  +  Bd*w  (2.13a) 

x(0)~N(0,/)  (2.13b) 

where  w(t)  is  an  n  dimensional  standard  Wiener  process  independent 
of  x(0)  and  the  fluctuation-dissipation  relation  is  satisfied, 

A  +  A*  +  BB*  =0.  (2.13c) 

By  the  above  discussion  this  process  also  satisfies  the  second  order 
stochastic  differential  equation 

d2x  =  Fxdt2  +  Gdxdt  +  H  d2w  (2.14a) 

where 

F  =  AZ~GA  (2.14b) 

G  =  —(A2  —  A*2)(HH*)~ 1  (2.14c) 

HH*  =  BB *  (2.14d) 

Moreover 

U  =  j(A  —  A*)  (2.15a) 

o  =  j(A2  +  A*2)  +  UU*.  (2.15b) 

The  stationary  Gaussian  reciprocal  one  dimensional  processes 
have  been  completely  classified  [7,8,11].  See  also  [15].  The  covar¬ 
iance  R(t)  must  satisfy  (2.2)  which  in  the  case  of  scalars  reduces  to 

R  =  FR. 

There  are  three  cases  F>  0,  F  =  0  and  F<  0.  If  F>  0  there  are  after 
various  normalizations  only  3  possibilities. 
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l  a)  Ornstein-Uhlenbeck  Process:  R(t)  =  e~\  t> 0,  which  satisfies 
the  second  order  equation. 

d2x  =  xdt2 +  y/2d2w  x(0)~N(0, 1) 

and  V  —  0,  <x=  1.  Of  course  this  is  also  Markov  and  satisfies  the  first 
order  equation 


d+x  =  -xdt-\-s/ld+w. 

l.b)  Cosh  Process:  J?(t)  =  cosh (3  — 0/cosh ^  for  O^t^l  which 
satisfies 


d2x  —  xdt2  +  y/l  tanh  \  d2w  x(0)  =  x(l)~N(0, 1) 

and  (7=0,  <r=l.  This  process  is  not  Markov  but  it  does  have  a 
realization  by  a  stochastic  differential  equation  with  an  independent 
boundary  condition  [15]. 

l.c)  Sinh  Process:  R{t) = sinh (3 - 0/sinh 3  for  O^rgl  which 
satisfies 


d2x  =  xdt2  +  y/2  coth  3  d2w  x(0)=  —  x(l)~N(0, 1) 

and  (7=0,  <r=  1.  Again  this  is  not  Markov  but  can  be  realized  by  a 
stochastic  differential  equation  with  an  independent  boundary  con¬ 
dition  [15]. 

2)  Slepian  Process  [4]:  /?(0=l-2r  for  O^f^l  which  satisfies 


d2x  =  2d2w 


x(0)=  —  x(l)~iV(0, 1) 

and  V = 0,  o=Q.  This  is  not  Markov  and  again  has  a  stochastic 
boundary  realization  [15]. 

3.a)  Cosine  Process:  /?(t)  =  cosr  for  —co<t<co  which  satisfies 


d2x—  —xdt2  x(0)=  —  x(n)~N(Q,  1) 
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and  V  =0,  a=  —  1.  This  is  not  Markov  but  the  sample  paths  are 
completely  determined  by  x(r,),  x(r2)  where  t,-t2  is  not  a  multiple 

Of  71. 

3.b)  Shifted  Cosine  Process:  R(t)=  cos (t  +  t)/cost  for  O^t^n  —  2t. 
To  be  a  covariance,  t  must  satisfy  0^T<n/2.  The  process  x(f) 
satisfies 

d2x  =  —  xdt2  +  V/Ttan  rd2w  x(0)=  —  x(7i  — 2t)~/V(0,  1) 

and  U  =0,  er  =  1.  This  process  is  not  Markov  and  cannot  be  realized 
by  a  scalar  first  order  stochastic  differential  equation  with  initial  or 
boundary  condition. 

We  close  this  section  with  another  interesting  example.  The 
Brownian  bridge  or  pinned  Wiener  process  x(r)  is  obtained  from  a 
standard  Wiener  process  by  conditioning  that  x(0)  =  x(l)=0. 
Another  representation  is  x(t)  =  w(t)  —  tw(l)  where  w(t)  is  a  standard 
Wiener  process.  This  is  a  zero  mean  Gauss-Markov  process  with 
covariance  R(r,s)  =  s(l  —  t)  for  O^s^r^l.  It  satisfies  the  first  order 
differential  equation 


d+ x  —  — — — —  dt  d+ w  x(0)  =  0 

(i-o 

and  also  satisfies  the  second  order  differential  equation 
d2x  =  d2w  x(0)  =  x(l)  =  0. 

Note  that  this  is  essentially  the  same  differential  equation  as  that  of 
the  Slepian  Process. 


3.  CHANGE  OF  VARIABLES 

In  this  section  we  develop  some  formulas  that  we  shall  need  in  the 
next.  Suppose  x(t)  is  a  second  order  diffusion  satisfying  (1.1)  and 
(1.2).  Let  <j>(x,t),ijf(x,t)  be  C®  scalar  valued  functions  and  define 
<f>(t)  =  4>(x(t),t)  t//(r)  =  t^(x(r),  i).  We  compute  the  centered  mean,  first 
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difference  and  second  difference  of  <f>(t)  using  the  identities 

dx(t)=  ±(x(t  ±dt)  —  x(t))  (3.1a) 

d2x(t)  =  2(x(t)  —  x(t)).  (3.1b) 


Now 


$(t)  = 


4>{t  +  dt)  +  <f)(t  —  dt) 


=  4>(x(t),  t)  +  \(4>{t  +  dt)  -  <f>{x(t),  t)  +  <t>(t  -  dt)  -  <p(x(t),  0) 

1  3^  d) 

4 ht)  =  4>(x(t),t)  +  -  — ~ -(x(t),t)dxidxJ  +  0(dx)4  +  0(df)2.  (3.2a) 

The  symbols  0(dx)4  and  O(dt)2  denote  quantities  that  go  to  zero  as 
fast  as  |dx|4  and  dt2  respectively.  Next 

<j>(t  +  dt)-4>(t-dt)  (p(x{t),  t)  —  <p(x(t),  t) 
d<t>= - . - - + - z - • 


By  a  similar  Taylor  expansion  we  obtain 

dd>  d<b 

d<t>  =  ~  (x(t),  t)  dXj  +  (x(t),  t)  dt 

0Xj  dt 


+ 


i  e3d>(x{t),t) 
6  ex,  exj dxk 

1  d3  <f>(x(t),  t) 


dx, dxj dxk 


+  2  -J^dX‘dX‘d' 

1  d3<J>(x{t),t) 

+  --  — r — zrr—dxidr 

2  dx ,  dt 2 


+ 


1 


d50(x(t),f) 


1 20  cx,  dxj  dxk  dx,  dxm 


dx,  dxj  dxk  dx,dx„ 


+  O(dx)6  +  0(dx)4  0(dt )  +  O(dx)2  O(dt)2  +  O(dt)3  (3.2b) 
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d2<f>  =  ~  (x(t),  t)  d2xi  +  ^  (x(t), 0 dXj-^d2Xi d2x^j 

d2<b  _  d2d>  , 

+  2  (x(0, 0  dx  f  dt  +  -£  (x(t),  t)  dt2 

dt  0Xi  dr 

i  fimtu) 

24  ax^Xjdx,  ,d  ‘‘  1 

+Ti  aSSk, 

,  i  d*mt),t)  J  ,  J 
+  3  ^x^dxjdx^dt  dX‘ dXj  dXk 

+  T~f  ^d^d^d  ^  d2xjd2xk  d2)C‘  d2xm 
1920  ox,-  dx^  0Xi  dxm 


+  0{d2x)6  +  O(dx)6  +  0(dx)*0(dt)  +  0(dx)20(dt)2  +  O(dt)3  (3.2c) 


Hence  it  follows  from  (1.1)  that 


1  d26 

£*(<?(*))  =  <P  +  ^  dx~fa' hikhjkdt  +  o(dt) 
cd)  dd> 

Ex(d<t>)  =  ^~-Uidt  +  -—dt  +  o(dt) 

CX:  dt 


(3.3a) 


(3.3b) 


E^d2<f>)  —  z^ifi+gijUj)  dt2 


d2d>  ,  d2d>  ,  d2d>  , 

+  t-~  nik  dt2  +  2  u,  dt2  +  -r^dt2  +  o(dt)2  (3.3c) 
dXi  dxk  dt  dxf  dr 


Exid<j>di{/)  =  ~<~  ~A,\hikhjkdt  +  nijdt2)  +  d^  ~dt2+o(dt)2  (3.3d) 
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E£d2<t>  d2tl/)  =  2 ~hirhJr  dt  +  o{dt)2 

0Xi  dXj 


l  dd>  d\l/  ,  dil/ 

Exid  <P  #) = ^ 7  SikKK  dt  +  ukhirhjr  dt 


2  dXi  dxj°“  'Kr  jr 
d2<t>  dip 


dt  dx(  dxj 


hirhjrdt2+o(dt)2. 
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(3.3c) 


(3.30 


The  right  sides  of  the  above  are  evaluated  at  (x(r),t)  =  (x,t).  Note 
that  these  are  not  in  the  form  of  (1.1)  in  that  the  mean  differences  of 
< p(t )  are  conditioned  by  x(t)  rather  than  $(r). 


4.  CONSERVATION  LAWS 

Suppose  x(t)  is  a  Markov  diffusion  satisfying  the  first  order  stochas¬ 
tic  differential  equation. 


d+x  =  fdt  +  hd+w.  (4.1) 

The  probability  density  p(x,t)  of  x(f)  satisfies  the  Fokker-Planck 
equation. 


lp+^pf‘)-ie^jph^0-  (42) 

This  is  a  second  order  parabolic  partial  differential  equation. 

Suppose  x(f)  is  a  strongly  reciprocal  diffusion  satisfying  the  second 
order  stochastic  differential  equation. 

d2x  =  /  dt2  +  g  dx  dt  +  h  d2w  (4.3) 

then  we  shall  demonstrate  that  the  density  p(x,  t),  mean  velocity 
u(x,  t)  and  momentum  flux  tensor  pn(x,  t)  satisfy,  at  least  in  the  weak 
sense,  a  system  of  conservation  laws,  very  similar  to  those  of 
continuum  mechanics. 


d  d 

-p  +  --{puk)  =  0 

dt  dxk 


(4.4a) 
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d  d 

(pUj)  =  p(fi+gikuk )  -  ( pnik )  (4.4b) 

ct  cxk 

1  )  s 

((lt  (ph„hjr)  =  ~  (xMlj'+gjMi)  -  dx  {pUkKhjX  (44c> 

In  addition  we  shall  show  in  Section  5  that  at  least  for  the 
reciprocal  Gaussian  processes  of  Section  2  an  additional  conser¬ 
vation  condition  must  hold 

j(pnij)  =  p(fiUj  +  Ujfi + giknkj  +  nikgjk) 

? 

-  .  {piu.UjUl  -  OijUk  -  aikUj  -  <7 jkUj)).  (4.4d) 

cxk 

Since  aij=uiuj-itij,  (4.4a, b,d)  appear  to  be  a  complete  set  of 
equations  for  the  unknowns  p,  u,  n  in  terms  of  /  and  g  in  the 
Gaussian  case.  But,  we  doubt  that  (4.4d)  holds  for  all  reciprocal 
diffusions. 

Before  we  derive  these  equations,  let's  take  a  look  at  their 
meaning.  Equation  (4.4a)  express  the  conservation  of  probability 
under  the  mean  flow  described  by  u.  This  corresponds  to  conser¬ 
vation  of  mass  in  continuum  mechanics.  A  similar  equation  relates 
the  density  and  current  velocity  of  Nelson  [18], 

Equation  [4.4b)  expresses  the  balance  of  momentum  pu.  If  we 
integrate  the  left  side  over  a  volume  in  x-space  we  obtain  the  time 
rate  of  change  of  momentum  within  the  volume.  The  integral  of  the 
right  side  of  (4.4b)  has  contributions  from  two  sources.  The  first 
integral  involving  p{fr*-gikuk)  is  the  change  of  momentum  due  to  the 
mean  acceleration  of  the  particles  inside  the  volume,  the  random 
acceleration  produces  no  net  change  of  momentum.  The  integral  of 
the  second  term  over  the  volume  can  be  converted  to  a  surface 
integral  over  its  boundary  by  Stokes'  Theorem.  The  integral  over  the 
surface  bounding  the  volume  of  pnik  contracted  with  outward  unit 
normal  is  the  total  flux  through  the  boundary.  Recall  the  definition 
(1.1b)  of  nu  as  the  dt2  part  of  E/dXjdxj).  This  tensor  has  a 
deterministic  and  a  random  component,  pnij  =  pUjUj  —  pOij  and  each 


RECIPROCAL  DIFFUSIONS 


413 


contributes  to  the  momentum  flux.  Notice  that  the  order  dt  part  of 
E^dxidxj)  in  (1.1b)  does  not  contribute  anything  to  the  momentum 
flux.  Intuitively  this  is  because  these  changes  are  so  fast  and  so 
random,  they  cannot  transport  momentum. 

In  continuum  mechanics  the  contraction  jpnu  describes  the 
density  of  kinetic  energy.  This  has  two  parts,  the  first  ipu.u,  due  to 
deterministic  part  of  the  velocity  and  the  other  —  (p/2)au  due  to  the 
random  part.  The  latter  is  frequently  called  the  internal  energy 
density. 

The  tensor  \ pn tj  describes  the  density  of  kinetic  energy  in  every 
component  of  the  x  process.  If  A,  is  a  constant  n  vector  then  the 
scalar  valued  process  z(t)  =  /,x,(x)  has  kinetic  energy  density  given  by 
{piiijXiXj.  For  this  reason  we  call  jpntj  the  tensor  kinetic  energy. 

There  is  an  alternative  definition  of  7r0  as  5 E^d*  xtd~  Xj  + 
d+  Xjd~ xi)  =  nijd2  +o(dt)2  which  reduces  to  the  standard  one  for 
smooth  process.  Based  on  this  and  Eq.  (4.4c)  which  we  discuss  in  a 
moment,  we  define  the  kinetic  energy  density  to  be  ( p/2)nu .  This 
definition  of  kinetic  energy  is  similar  in  spirit  but  different  from  that 
of  Guerra-Morato  [19].  It  is  interesting  to  note  that  some  of  the 
examples  of  Section  2  have  negative  or  zero  kinetic  energy.  For  the 
Ornstein-Uhlenbeck,  Cosh  and  Sinh  process,  n=  —  1,  and  for  the 
Slepian  process  7t=0.  The  Brownian  bridge  has  both  negative  and 
positive  kinetic  energy  depending  on  x  and  t. 

Equation  (4.4d),  which  may  hold  only  for  Gaussian  processes,  is  a 
tensor  form  of  the  balance  of  kinetic  energy  and  work.  In  other 
words  Eq.  (4.4d)  expresses  the  balance  of  kinetic  energy  and  work 
for  every  scalar  process  z(r)  =  A,x,(t).  The  momentum  flux 
or  tensor  kinetic  energy  is  (p/2)ku,  the  dt2  part  of  (p/2)E^dXidXj). 
The  tensor  part  of  the  rate  of  work  done  or  power  is  the  dt2  part  of 
(p/2)E^d2xidXj  +  d2xidxj)  which  explains  the  first  term  on  the  right 
side  of  (4.4c).  The  second,  flux  term  represents  the  flow  of  tensor 
kinetic  energy  across  the  boundary  of  the  volume  under  consider¬ 
ation.  This  flux  has  contributions  both  from  the  deterministic  and 
random  parts  of  the  motion.  The  first  term  u{ujuk  represents  the  flux 
due  to  strictly  deterministic  motion,  the  others  due  to  a  mix  of 
deterministic  and  stochastic  motion.  In  continuum  mechanics, 
(p/2)otiuk  is  the  flux  of  internal  energy  and  pa/ku,  is  the  flux  of  energy 
due  to  viscosity  or  stress.  In  our  stochastic  model,  (p/2)<ruuk  is  the 
flux  of  random  kinetic  energy  transported  by  the  mean  velocity  and 
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paikUi  is  the  flux  of  energy  due  to  the  random  motion  of  particles 
between  regions  of  differing  mean  velocity. 

But  (4.4d)  only  expresses  a  balance  between  tensor  kinetic  energy 
and  tensor  work  terms  of  size  dt2.  The  quantities  involved  also  have 
terms  of  size  dt  and  the  balance  of  these  is  expressed  by  (4.4d).  We 
view  phirhjr/4  as  the  tensor  form  of  the  hyperkinetic  energy  due  to 
hypervelocity  part  of  dx,  namely,  '3x  =  0(dt)1/2.  The  tensor 
(p/2)(gAAr+sAA>)  is  the  hyperpower  and  (p/2)hirhjruk  is  the  flux 
tensor  of  hyperkinetic  energy.  Of  course  this  extra  equation  leads  to 
an  overdetermined  system  of  equations  for  p,  u,  and  n  but  if  we 
consider  hirhjr  as  an  unknown  also,  this  problem  disappears.  An 
interesting  question  which  we  don’t  address  is  that  of  boundary 
and/or  initial  conditions  for  (4.4). 

In  particular,  (4.4c)  implies  that  we  cannot  find  processes  satisfying 
the  second  order  stochastic  differential  equation  (1.2)  for  arbitrary 
choices  of  f  g  and  h.  Notice  that  if  h  is  constant  in  x  and  t,  (4.4c) 
and  (4.4a)  imply  that  the  tensor  field  g(x,  t)  is  skew-symmetric 
relative  to  the  symmetric  tensor  field  h*2{x,t),  i.e., 

ghh*  4-  hh*g*  —  0.  (4.5) 

We  shall  derive  Eqs.  (4.4a,  b,c)  using  no  a  priori  assumptions  of 
conserved  quantities.  Rather  they  shall  follow  from  basic  mathemati¬ 
cal  facts.  We  warn  the  reader  that  our  derivation  is  somewhat 
formal,  we  shall  interchange  limiting  operations,  neglect  small  quan¬ 
tities,  etc.  In  the  next  section  we  shall  verify  that  the  reciprocal 
Gaussian  diffusions  treated  in  Section  2  satisfy  (4.4a,  b,c)  and  also 
(4.4d). 

Before  we  start  we  list  some  basic  formlulas  about  centered 
differences  that  will  be  useful.  Let  x(r),  y(r)  be  n-dimensional 
processes  defined  on  [0,  T].  Suppose  0<to<t1<T  and  tr  =  T0  + 
(r  —  j)dt,  r i=tN  +  jdt.  Then 

Z  dx(tr;  dt)  =  x(rl;  dt/2)  —  x(r0;  dr/2)  (4.6a) 

r=  1 

I  d2x(tr;  dt)  =  2(dx(xi ;  dt/2)  - dx( t0;  dt/2))  (4.6b) 

r  =  1 


I 
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d(xy*){t\ dt)  =  x(t;  dt)  dy*(t ;  dt)  +  dx(t;  dt)y*(t;  dt )  (4.6c) 

dx(t;  dt)  =  Jx(t;  dt)  =  j  dx(t;  2  dt)  (4.6d) 

d(dx)(t,dt)  =  id2x(f,2dt)  (4.6e) 

d(dx  dy*)(t;  dt)  =  g( dx(t ;  2  dt)  d2y*(f,  2  dt) 

+d2x(t;2dt)dy*{t,2dt)).  (4.6f) 


Let  x(t)  be  a  second  order  diffusion  satisfying  (1.1a -f)  with  density 
p(x,t),  mean  velocity  u(x,t),  momentum  flux  p(x,t)  n(x,t)  and  stress 
p(x,t)  <r(x,  t).  We  assume  that  as  |x|-»oo,  p  goes  to  zero  faster  than 
every  rational  function  of  |x|  uniformly  for  all  re[ 0,7].  We  also 
assume  that  |u|,  |rt|  and  |oj  are  bounded  above  by  some  polynomial 
in  |xj  for  all  re[0,  T\.  Let  <p(x,t)  be  a  smooth  scalar  valued  function 
also  bounded  in  norm  by  a  polynomial  in  |x|  and  suppose  <f){t,x) 
vanishes  off  some  closed  subinterval  of  (to.Tj).  Finally  we  assume 
that  density  p(x,t,dt)  of  x(t;dt)  converges  to  the  density  p{x,t)  of  x(t) 
as  dt-* 0. 

Using  (4.6a)  we  have 

N  N 

0=££  d(f>(tr;dt)  =  E^Exid<P(tr;dt)) 

r=  1  1 

We  employ  (3.3b)  and  let  dt-+  0  to  obtain 

0=ii(^+f)‘,dtdx 

Integration  by  parts  yields  a  weak  form  of  (4.4b), 

0=\\4,{t+£rk)d,dx- 

In  similar  fashion  (4.6b)  yields 

0  =  ~  E  £  d2<p(tr- dt)  =  j  E£  Exld24>(t, ■  dt)) 

dt  r =  |  ut 
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We  employ  (3.3c)  and  let  dt-* 0  to  obtain 


f  f  (d<f>  d2(f>  d2(j)  d2<(>\  .  , 

J  J  Wf +g“Uj) + +  + 


Integrating  by  parts  yields 


By  (4.4a)  this  reduces  to  a  weak  form  of  (4.4b), 

0=}{s,  +g»Uj)  ~  ht  ('m“)  -  f,  <pu))d' dx 

Finally  we  start  with  (4.6f)  applied  to  <j> ,  which  we  sum  and  divide 
by  dt  to  obtain 


0-  —  EX  J‘MV2dt)i<Hv2il) 
odt  r=i 


=  AeZ  EiWVft,;  2  it)  iMt,:  2  it)). 

O  dt  ! 


So  by  (3.3f) 


^  1  dd>  d(b  d2<j)  c<t> 

0=e^'2Ff£  + FFTx  ff  Ut"  * 

I  L  l/Aj  l/Aj  t'Aj  C  A^  (  A  j 

d2<b  dd> 

+  t:  ^  -  Fr~hirhjrdt  +o(dt). 

OtCXj  OXj 


As  df-+0  we  obtain 


ff/1  d<b  d<t>  d26  d<t>  c2<p  dd>  \ 

°=  U/  J  ,  «M>+  -  ,  ,  hirhjr)pdtdx. 

J  J\2  oxi  OXj  oxioxk  cxj  (tcXjdXj  ) 
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We  symmetrize  this  with  respect  to  i  and  j, 


1  f  f  / 1  8(f)  8(p 

0=2)){2^l{g*K'h*+e‘M') 


and  integrate  by  parts 


o=^n^^(itaAr',"+?AA') 


8  8 

~  fa~(pUkh"hj')~  Jt(Phirhjr))dtdx 


ijY 


which  we  recognize  as  a  weak  form  of  (4.4c). 


5.  RECIPROCAL  AND  GAUSSIAN  PROCESSES 
REVISITED 

In  this  section  we  verify  that  the  Gaussian  (reciprocal  Gaussian) 
processes  discussed  in  Section  2  satisfy  the  three  (four)  conservation 
laws  of  Section  4.  Let  x(t)  be  a  Gaussian  process  with  smooth 
covariance  R(t,s)  satisfying  (2.1),  (2.4)  and  (2.6)  where  F(f),  G(r)  and 
H(t)  are  defined  by  (2.7)  and  u{x,  t),  n(x,t)  and  <r(x,r)  by  (2.8).  Since 
R(t,t)  =  I  (2.1a), 

p(x,t)  =  (2n)-nl2  exp  —  i|-x|2 

which  satisfies 


^  =  0 

8t 

(5.1a) 

8p 

T  =  ~pxk 

(5.1b) 

consider  the  conservation  of  probability  (4.4a) 


SHX  H  <• 
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=  p[F  +  GU]ijXj  —  pGikxk 

+  PlUxMx*U*x)-plUx-]iUkk-piUUljXJ 
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which  equals  the  left  side  by  the  skew  symmetry  of  U. 

Next  we  verify  (4.4d)  for  reciprocal  Gaussian  processes.  In  this 
case  (4.4d)  becomes 

—  pnij  =  p[Fxx*U*  +  Uxx*F*  +  Gn  +  7tG*]i; 

-  P(  UirUjm  Uklxp cmx,  -  <TijUkrxr 

-aikU  jrxr-oJkUirxr).  (5.5) 

It  is  convenient  to  break  up  each  side  of  this  equation  imo  terms 
that  are  time  varying  multiples  of  p  and  terms  that  are  time  varying 
multiples  of  pxx*,  there  are  no  others.  We  refer  to  these  as  constant 
and  quadratic  terms. 

On  the  left  side,  the  constant  terms  are 

p[  d3R  d3R  d3R*  d3R*  1  Vd  1 

2\_8sdt2  +  dtds2  +  dsdt2  +  dtds2\ij  P\^dt  J,/ 

By  (2.4)  (2.6)  this  equals 

pi"  cfd2R  d2R*\  /  d2R  d2R*\  m 
2  L  \dt  ds  +  dtdsj^  \dt  ds  +  dt  ds  )  ^ 


By  (2.8b, d)  and  (5.4)  this  equals 

plGlUU* -o)  +  (UU*  -a)G*  +  FU*  +  U F*lj 

-p[_(F  +  GU  +  UU*-o)U*  +  U(F  +  GU  +  UU*-c)*lj 

=  pUU-G)G  +  <T(U-G)*lj 
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which  equals  the  constant  terms  on  the  right  side  of  (5.5). 

The  quadratic  terms  on  the  left  side  of  (5.5)  are 

p  ~  (Uxx*U*)ii  =  p[(F  +  GU -  a  +  UU*)xx*U * 
at 

+  Uxx*(F  +  GU  —  a  +  UU*)~\ij 
On  the  right  side  (5.5)  the  quadratic  terms  are 
p[_(F  +  GU)xx*  U*  +  Uxx*(F  +  GU)*lj 

+  pxkUkkx,[Uxx* U*lj - pxk{aik  UJrxr  +  a]k  V irxr) 
-p[VUxx*U*  +  Uxx*U*U*]u 

~pUkklUxx*U*lj 

=  p\_(F  +  GU —o+UU*)xx*U* 

+  Uxx*(F  +  GU-o+UU*)*lj 
as  desired  to  prove  (5.5). 

Finally  we  verify  (4.4c)  for  Gaussian  processes  which  reduces  to 

y(pHH*)  =  ?-(GHH*  +  HH*G*)- ~(pUkrxr)HH*.  (5.6) 

ot  2 

By  (5.2)  this  becomes 

d 

-  HH*  =  \  (GHH*  +  HH*G*). 

By  (2.7c)  the  left  side  equals 

/ d2R  d2R  d2R*  d2R*\ 

\dr2  +  dtds  dtds  ds2  ) 


Since  R(t,t)  =  I,d2/dt2R  =  0  and  so 
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1  d2R  d2R  _  1  d2R 

2  dt2  +  dtds  2  ds2 

This  reduces  the  left  side  of  the  above  to 

_1/^R_^_^R*  d2R*\ 

2\dt2  ds2  ds2  dt2  ) 

which  equals  the  right  by  (2.7,  a,  c). 

6.  CONCLUSION 

We  have  described  a  theory  of  stochastic  differential  equations  of 
second  order  and  have  demonstrated  that  the  theory  is  not  vacuous, 
it  includes  the  reciprocal  Gaussian  processes  which  satisfy  some  mild 
assumptions.  We  have  also  demonstrated  that  the  density,  mean 
velocity  and  momentum  flux  obey  a  system  of  nonlinear  conser¬ 
vation  laws  similar  to  those  of  continuum  mechanics. 

Obviously  considerable  work  remains  to  be  done  including  the 
following. 

1)  A  theory  of  stochastic  integration  for  second  order  stochastic 
differential  equations. 

2)  Further  study  of  nonlinear  second  order  stochastic  differential 
equations  and  non-Gaussian  reciprocal  processes. 

3)  Possible  applications  in  statistical  mechanics,  continuum  and 
fluid  mechanics  and  quantum  stochastic  mechanics. 
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Abstract  A  first  degree  approximation  by  a  linear  system  is  the 
standard  approach  for  treating  most  nonlinear  systems.  Exact 
transformation  of  certain  nonlinear  systems  into  linear  systems  is 
possible  under  nonlinear  state  feedback  and  coordinate  change,  as 
shown  by  Jakubzcyk  and  Respondek  [13J  and  Hunt  and  Su  [10]. 
The  approximation  of  nonlinear  systems  to  higher  degrees  by 
linear  systems  has  been  treated  in  [16]  and  recently  in  [17].  In  this 
paper,  we  develop  a  method  of  solution  to  find  such  higher  degree 
approximations  by  reducing  the  linearization  problem  into  the 
solution  of  a  set  of  linear  equations.  We  suggest  a  solution  that,  in 
some  sense,  minimizes  the  error  in  the  approximation. 

1.  Introduction 

In  the  analysis  of  scientific  and  engineering  systems,  one  often 
encounters  situations  which  do  not  lend  themselves  to  exact 
solutions  by  conventional  methods.  The  assumption  of  linearity  in 
most  control  system  models,  for  example,  is  an  oversimplification 
at  best,  and  it  reflects  the  difficulties  one  would  rather  avoid  in 
dealing  with  an  otherwise  nonlinear  model.  Seldom  a  technique 
can  be  found  to  solve  a  given  nonlinear  problem  exactly.  Since 
the  control  system  designer  is  equipped  with  powerful  methods 
and  tools  for  attacking  linear  control  systems,  the  motivation  for 
“linearizing”  a  nonlinear  problem  is  clearly  very  strong. 

Therefore,  whenever  possible,  a  nonlinear  control  problem 
must  be  suitably  transformed  to  bring  it  into  an  appropriate  form 
that  enables  the  implementation  of  linear  control  design 
techniques.  However,  the  systematics  of  such  modifications  by 
transformation  are  usuallly  not  self-evident.  The  simplest  of 
these  modifications  is  a  first  degree  linear  approximation  by 
calculating  a  series  expansion  at  a  nominal  operating  point.  The 
validity  of  this  approximation  depends  on  the  relative  size  of  the 
second  degree  terms.  In  systems  where  nonlinearities  are  strong, 
the  higher  degree  terms  cannot  be  neglected,  and  the 
approximation  fails. 

The  earliest  example  on  the  question  of  whether  a 
nonlinear  system  can  be  equivalent  to  a  linear  system  under  some 
group  of  transformations  such  as  change  of  coordinates  was 
solved  by  Poincard  [19].  Various  researchers  in  [6,8,9,15,20,21] 
discussed  the  question  of  when  a  nonlinear  control  system  can  be 
transformed  into  a  linear  system  by  a  change  of  state  coordinates. 
Jakubczyk  and  Respondek  [13],  and  Hunt  and  Su  [10,11,12] 
independently  considered  the  full  state  feedback  and  coordinate 
change  problem.  Related  work  also  appeared  in  [2,22,23,24].  In 
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[16],  Krener  investigated  an  approximate  linearization  considering 
the  second  and  higher  degree  terms  in  the  truncated  series 
expansion  of  a  the  vector  field,  and  proved  a  weakened  version  of 
the  Hunt-Su  linearization  condition.  In  [17],  further  results  in  an 
attempt  to  solve  for  the  resulting  transformations  were  presented. 
An  application  for  nonlinear  observers  also  appeared  in  [4J. 

The  objectives  of  this  paper  are  to:  1), Present  a  solution  to 
the  approximate  linearization  problem,  2)  Suggest  a  method  to 
solve  the  Homological  equations  to  minimize  the  error  in  the 
approximation  in  some  sense.  For  further  work  see  [14]. 

2.  Higher  Degree  Approximations  to  Autonomous 
Systems:  Normal  Form  Theorem 

In  this  section,  the  normal  form  theorem  of  Poincarf  will  be 
introduced.  The  approximate  linearization  problem  for  a  control 
system  will  be  formulated  later  in  a  similar  spirit  As  a  reminder 
of  the  connection  between  the  two,  we  continue  to  use  the  term 
“Homological  Equations”  (after  Arnold  [I]), 

Let  us  consider  an  autonomous  system: 

*=/(*)  (la) 

*(0)=jr\  (lb) 

where  x  r  IP  "  and  the  system  is  assumed  to  be  at  rest  at  the 
origin,  i.e.  /(0)  =  0.  Without  loss  of  generality  we  will  assume 
x'  =  0.  First,  consider  the  linearization  of  (1)  at  r": 
x-Fx  (2a) 

5/ 

F-f-m  (2b) 

dx 

We  will  seek  a  coordinate  change  for  (1)  of  the  form  identity  plus 
higher  degree  terms,  such  that  the  resulting  system  will  agree  with 
(1)  up  to  an  error  of  degree  0(x)p+1  where  p  is  the  degree  of 

approximation.  The  following  treatment  is  for  p  =  2.  The  results 
can  be  easily  generalized  to  any  arbitrary  degree  p  by  induction. 

We  assume  a  transformation  of  the  form: 

z  =  x- $(2\jc)  (3) 

(2) 

where  z  denotes  a  new  set  of  coordinates.  $  (x)  is  a  polynomial 

of  degree  2.  The  function  fix)  in  (la)  is  expanded  in  a  series: 

fix)  =/(,)(x)  +/<2)(x)  +_Q(x)3 

=  Fx  +  /%  +  OOc)3  (4) 

(2) 

The  goal  of  the  transformation  (3)  is  to  choose  $  (x)  such  that  in 
r  coordinates  the  dynamics  of  the  system  is  represented  by 


i  =  Fz  +  O(z)3  (5) 

namely  the  second  degree  terms  in  the  series  expansion  (4)  vanish 
under  the  coordinate  change.  We  take  the  time  derivative  of  (3) 
and  using  (la),  (4)  and  (5)  evaluate  each  side  by  ignoring  0(x)3 
and  higher  terms: 


F(x  -  4»<2,(.r)) 


(2) 

..  ,Q),  ,  a*  \x)  „ 

■■Fx+f  (x)--L- - Ft 

dx 


(6) 


Now  we  introduce  some  notation.  The  Lie  bracket  of  two  vector 
fields /,  g  is  another  vector  field  defined  by: 

(7) 


Rearranging  and  cancelling  terms  in  (6),  and  using  (7)  we  obtain 
/(2)(x)  =  [Fx,*<2)(x)J  (8) 

Equation  (8)  is  called  the  Homological  Equation  [  1  ].  In  (5J,  a 
similar  derivation  is  also  presented.  The  Lie  bracket  operation  in 
the  above  defines  a  mapping 

(9) 


[Fx,  •  1 :  <t><2)(jc) 


■  IFx,4><2)(x)I 


Obviously,  (9)  represents  a  linear  mapping  from  n(n  +  l)/2 

(2) 

dimensional  parameter  space  of  the  coefficients  of  4>  (;r)  to  an 
2 

n  («+  l)/2  dimensional  parameter  space.  The  question  is  whether 

Ax)  in  the  range  of  this  mapping,  i.e.  can  we  always  find  a 

(2) 

*  (x)  that  will  satisfy  (8)?  This  problem  was  first  solved  by 

Poincard  [191-  In  the  following,  we  present  a  slightly  modified 
proof  that  closely  follows  ( 1 ,5J: 


Suppose  F  has  a  full  set  of  linearly  independent 
eigenvectors.  Then  we  can  take  the  right  eigenvectors  of  F  as  a  set 
of  basis  vectors,  and  the  left  eigenvectors  as  a  set  of  coordinates, 
which  are  defined  by 

Fv*  =  X*v*  (10a) 

wiF  =  \iwi  (10b) 

where  v*e  C  "  *  1 ,  w(  e  C  *  x  "  and  X,-,  Xte  C.  We  define  a  basis 
for  n-dimensional  vector  valued  polynomials  of  degree  2  as 
follows: 

4>kjj(x)  =  vk(wjx)(w/x)  (orj,  k  =  1 . n  ;  *  =  1,  ...j.  (1 1) 

Using  this  basis  for  the  polynomials  in  Eqn.  (8),  we  evaluate  the 
Lie  bracket: 

[Fx,**  (*)]  =  (X,.  +  \j  -  X*)**.(x)  (12) 

The  mapping  (9)  is  onto  if  (Xf  +  X;  -  Xt)  *  0  for  all 
j,  k  =  1,  i  =  1,  ...J.  In  the  literature,  this  is  called  the 
resonance  condition.  We  note  that  this  is  only  a  sufficient 
condition.  A  general  proof  for  the  case  when  F  does  not  have  a 
full  set  of  independent  eigenvectors  may  be  found  in  [1], 

The  above  can  easily  be  extended  to  an  arbitrary  degree  of 
linearization  p.  We  present  the  final  form: 

+  •••  + V  (13) 

with  (X(|  +  •  •  •  +  Xj  -  X4)  *  0  the  condition  of  no  resonance. 


3.  Higher  Degree  Approximations  to  Control  Systems 
In  this  section  we  will  seek  a  solution  to  the  problem  of 
linearization  for  control.  Full  state  observability  is  implicitly 


assumed.  Consider  a  nonlinear  system  affine  in  control: 


x=f(x)  +  g(x)u  (14a) 

x(0)  =  x\  (14b) 

where  re  R"  and  u  e  (Rm.  The  system  is  assumed  to  be  at  rest 
at  the  nominal  operating  point  (jr*;  u'  =  0).  Again,  we  will  assume 
x*  =  0.  First,  consider  the  linearization  of  ( 14)  at  x‘: 

x  =  Fx  +  Gu  (15a) 

F  =  ~(0),  G  =  g(0).  (15b) 


We  want  a  coordinate  change  for  (14)  of  the  form  identity  plus 

higher  degree  terms,  such  that  the  resulting  linear  plant  will  agree 

with  (14)  up  to  an  error  of  degree  0(x,u)p+l  (i.e.  terms  of 

0(x)p+ 1  and  0(x,u)p)  where  p  is  the  degree  of  approximation. 

When  p  *  1,  the  first  degree  approximation  (15)  is  obtained. 

Similar  to  the  previous  section,  the  case  for  p  *  2  will  be  derived 

first,  and  the  results  will  be  generalized  to  an  arbitrary  degree  p  by 

induction.  First,  the  functions / and  g  are  expanded  in  a  series: 

Ax)  =/%  +/(2,(x)  +  0(x)3 

=  Fx  +/(2)(x)  +  CKx)3  (16) 

g(x)  =  gW)(x)  +  g(,)  (x)  +  O (x)2 

m  2 

=  C+j  \x)  +  Ou)  (17) 

and  the  nonlinear  system  (14a)  is  rewritten  as 
x=  Fx  +/(2)(x)  +  (C  +  g(l>(x))u  +  0(x,u)3  (18) 

We  assume  a  transformation  similar  to  the  one  proposed  in  Sec.  2: 
(2) 

z  =  x-*  (x)  (19) 

In  addition,  a  new  input  v  is  chosen  as 

v  =  a(2)(x)  +  (I  +  P0)(x))u  (20) 

(2) 

where  a  (x)  is  an  n  x  1  vector  of  second  degree  polynomials, 
and  I  +  P*  \x)  is  an  m  x  m  identity  matrix  plus  first  degree 
terms  with  nonsingular  p(x).  Now  we  want  the  system  to 
become,  in  z  coordinates, 

z  =  Fz  +  Cv  +  0(r,v)3  (21) 

We  take  the  time  derivative  of  (19),  and  introducing  (18),  (19), 
(20)  and  (21)  we  obtain: 

Ax)  =  [Fx,*<2)(x)]  +  Ca(2)(x)  (22a) 

\x)u  =  [Gu,*^(x)l  +  GP^\x)u  V  constant  u.  (22b) 
Because  of  its  similarity  to  the  homological  equations  derived  in 
the  previous  section,  we  call  (22)  the  Generalized  Homological 
Equations.  For  a  detailed  derivation  of  (22),  see  [17).  In  the 

same  reference,  the  above  approximation  is  extended  to  an 
arbitrary  degree  as 

/W W  =  [Fx,*(p)(x)J  +  Ca(p)(x)  (23a) 

g^~'\x)u  =  [<7u,*^>\x)|  +  Cp^'\x)«  V  constant  u.  (23b) 
The  resulting  system  is  accurate  up  to  degree  p: 

z  *  Fz  +  Cv  +  CKz,v)P+1  (24) 

Once  a  higher  degree  linear  approximation  is  obtained,  one 
of  the  important  issues  is  the  stability  of  the  closed  loop  system. 
Thus  one  may  choose,  for  instance,  a  linear  state  feedback  for  the 
approximate  model 


i  =  Fi  +  Gv  (25) 

by  setting  v  =  Kz.  The  gain  matrix  K  is  chosen  such  that  in  the 
closed  loop  the  system  gives  the  desired  performance.  If  we 
assume  that  the  model  has  been  linearized  up  to  second  degree, 
using  the  feedback  v  =  Kz  and  Eqn.  ( 19)  we  evaluate  (20): 

Kx  -  K*{\)  =  aa\x)  +  (l  +  P(%)u  (26) 

and  calculate  the  feedback  u  as: 
u  =  (i  +  P(I)(z))-'(a:x  -  Kt>  2\x)  -  ct(2)(x)) 

=  ATx  -  (  p( '  \x)Kx  +  K<p(2\x)  *  a(2\x) }  +  O(x.u)3.  (27) 

In  the  above,  purpose  of  the  feedback  u  becomes  immediately 
clear.  In  addition  to  the  linear  feedback,  there  are  second  degree 
correction,  terms  (inside  curly  brackets  in  (27)).  While  the 
purpose  of  the  feedback  u  =  Kx  is  to  achieve  stability,  pole 
placement,  etc.  for  the  first  degree  approximation  (15a)  to  get 
x  =  (F  +  GK)x,  (28) 

the  feedback  (27)  cancels  certain  second  degree  terms  to  achieve  a 
second  degree  approximation  (accurate  to  second  degree  compared 
with  a  linear  model)  toward  the  same  feedback  design  goals: 
x  =  (F  +  GK)x  +f(2\x)  +  g(l\x)Kx  -C{p°Wx 

+  *0(2)(x)  +  a(2)(x) }  +  0(x,u)\  (29) 

An  important  feature  of  the  feedback  (27)  and  the  resulting  closed 
loop  system  (29)  is  that  one  need  not  transform  the  state  variables 
into  the  new  coordinates  z  that  was  introduced  for  the  sake  of 
calculations.  Feedback  design  can  be  performed  in  the  natural 
coordinates  in  which  the  system  is  originally  presented.  If  some 
of  the  states  are  not  observable,  one  can  estimate  the  unavailable 
state  variables  by  means  of  an  observer,  and  apply  the  same 
procedure.  For  further  work  on  this  problem,  see  [  1 8  J. 


4.  Analysis  of  the  Linear  Mapping 

In  the  homological  equations  (22)  of  Sec.  3,  the  second  degree 
terms  f^\x)  and  g^\x)u  can  be  cancelled  out  under  certain 
solvability  conditions  by  proper  choice  of  ^\x),  a*\x),  and 
p^'Vx).  When  the  coefficients  of  the  like  terms  in  (22)  are  set 
equal,  a  linear  mapping  is  obtained  as 


f  A*) 

\  Ax) 


p°w 


(30) 


A  simple  count  yields  the  dimensions  of  the  domain  and  the  range: 

n2(n  +1)  mn(n  +  1)  .  2  v  n2(n  +  1)  .  2 

2  +  j  +  fw  n  7  j  —  +  n  m  (31) 

To  analyze  the  mapping,  we  make  a  table  for  the  dimensions: 
Form  =  -L  For  m  =2: 

State  State 


Space  Domain 

Range 

Space 

Domain 

Range 

n-2  11 

10 

n  =  2 

20 

14 

n  =  3  27 

27 

n  =  3 

42 

36 

n  =  4  54 

56 

n  =»  4 

76 

72 

n  =  5 

125 

125 

. 1 

Dimensions  of  the  domain  and  the  range  become  equal  whenever  it 
*  2m  +  1.  However,  this  does  not  imply  that  the  mapping  is  of 
full  rank.  For  example,  when  m  *  1,  n  *  3  the  rank  is  26,  not 
27.  In  general,  for  a  single  input  system,  the  rank  of  the  mapping 
is  always  one  less  than  the  dimension  of  the  domain  for  it  2  3. 


We  will  restrict  our  analysts  to  the  second  degree 
linearization  problem  with  a  single  input  u,  i.e.  m  =  1.  We  will 
start  with  the  analysis  of  the  linear  mapping. 

A  necessary  condition  for  finding  a  coordinate  change- 
feedback  pair  for  a  nonlinear  control  system  is  the  local 
controllability  condition  at  the  nominal  point  For  the  system  (18) 
with  a  scalar  input  this  implies 

rank  [G  FG  ...  F"-IGJ  =  n.  (32) 

On  the  other  hand,  we  define  a  1  x  n  matrix  K  such  that 
...  f0  1  H  i  <  n 

KF1  ~lGJ  =  |  (33) 

11  t  =  it 

Then,  the  following  collection  of  one  forms  is  of  full  rank: 
rank  ATF  . . .  KF”~l )  =  it.  (34) 

(32)  and  (34)  together  imply  that  we  can  define  a  basis  for  the 
second  and  first  degree  monomials  as  follows.  Define  as  a  basis 
v*«F*~l<7  (35a) 

and  a  co-basis 

w(.  =  KF‘ (35b) 
Now  we  define  a  basis  for  second  degree  monomials  as 

4>*yto  *  vk(wix)(wjx)  for  j,  k  -  1 . n  ;  »  =  1,  ...j.  (36) 

and  a  basis  for  first  degree  monomials  as 

♦j(x)  =  v*( wpc)  fort  *  I . it ;  i  =  1,  ...ji.  (37) 

Using  the  definitions  (36)  and  (37)  is  a  great  convenience  for 
calculating  the  Lie  brackets  that  appear  in  the  generalized 
homological  equations  (22).  Calculation  of  (22a)  gives 
[Fx,<>*/x)  ]  = 

^i  +  l  J+  ^  ij+l 


.*  +  1 


1 


1  S  t  £  j  <  it;  1  £  k  <  it 
1  £  i  <  j  -  it;  IStcit 
i  =  j  =  n  ;  1  £  t  <  it 


(38) 


In  the  evaluation  of  (38),  when  t  =  it,  the  expressions  become 
slightly  more  complicated.  However,  transforming  the  control 
system  into  a  Brunovsky  canonical  form  [3]  prior  to  the 
linearization  helps  simplify  the  expressions  [14]. 

Next,  we  calculate  (22b) 

r  0  i,  j  <  it 


[C,$*/x)  ]  =  <«>■ 

U; 

These  two  formulas 


of  the  mapping 
r  ^ 


Ax) 

Ar). 


i  <  j  =  n  (39) 

i  =  j  =  n 

are  used  to  compute  the  kernel  and  co-kernel 


>(2)(x)| 

V'W 


(30) 


and  we  now  obtain  a  set  of  linear  equations  expressed  in  matrix 
form: 


7  7 

In  (40),  L  is  a  constant  coefficient  matrix  of  n  (n+l)/2  +  n  rows 
by  n  (n+ 1)/2  +  n(n+ 1)/2  +  n  columns  that  is  found  from  the  above 
evaluation  of  the  Lie  brackets  of  the  mapping.  In 


the  constant  coefficients  of  their 


corresponding  second  degree  terms  are  stacked  in  a  consistent 
lexicographic  ordering.  For  the  single  input  linearization  problem, 
the  column  rank  of  L  is  (rt2(n+l)/2+n(n+l)/2+n  -1). 

A  solution  to  the  linearization  problem  is  developed  as 

follows.  First,  we  note  that  since  the  mapping  (40)  is  deficient  in 

(2) 

rank  for  n  >  2,  a  control  system  with  nonlinear  terms  /  (x)  and 

g^'\x)«  will  not,  in  general,  have  an  exact  solution  to  yield  a 
second  degree  linearization.  In  fact,  the  Hunt-Su  result  (9J  (or 


Kroner's  extension  of  the  same  to  the  approximate  case  in  (16])  is 
a  test  for  precisely  this  condition.  Consequently,  Eqn.  (40)  will 
not  usually  have  an  exact  solution.  For  a  system  with  n  =3,  m  =1 : 
2/333  -*3  +*2  =  0  (41) 

is  the  condition  for  exact  linearizability  up  to  second  degree.  In 
(41).  /jV represents  the  coefficient  of  an  element  of /<2)  in  the 
basis  <t>*j(x)  for  second  degree  monomials  obtained  when  the 
system  is  in  Brunovsky  canonical  form.  Similarly,  gj  is  the 
coefficient  of  the  corresponding  element  of  g  (*)  in  the  basis  for 
first  degree  monomials.  Eqn.  (41)  is  called  the  co-kernel 
equation. 

When  an  exact  solution  does  not  exist,  it  is  reasonable  to 
seek  an  approximate  solution  which  will  minimize  the  error  in  the 
linearization  with  respect  to  some  norm.  In  order  to  give  a  precise 
meaning  to  this  problem,  first  assume  that  we  have  adequate 
knowledge  about  the  operating  regime  of  the  control  system  and 
the  desired  accuracy  as  determined  by 

p(x,u):  A  probability  density  function;  typically  uniform 
over  some  compact  set,  or  Gaussian. 

Q:  A  sensitivity  matrix,  positive  definite. 


And  define  the  “error” 


We  want  to  choose 


such  that  the  above  error  is 


minimized.  Note  that  this  term  is  in  the  range  of  the  mapping,  i.e. 
it  satisfies  the  generalized  homological  equations 


7{2\x)  =  (Fx,<>(2)(r)l  -t-  Ga~2\x) 


(43a) 


g^l\x)u  =  [G«V^(x)l  +  Gp^(x)u  V  constant  u.  (43b) 


<x(2) 


that  will 


Furthermore,  we  wish  to  choose  the  smallest  I 

U‘7 

achieve  the  above.  Again,  we  choose  positive  definite  matrices  5, 
R  and  minimize 


JJ- 


|$(2J|^  +  |a(2)(x)  +  P(U(x)u  |^p(x,u)dxdu  (44) 


,(•> 


or  one  can  take  a  weighted  combination  of  the  above.  In  fact,  5 

can  be  taken  to  be  equal  to  Q  of  (42),  but  the  choice  is  not  limited 

to  this  case.  We  illustrate  the  minimization  in  Figs.  (1)  and  (2). 
2 

Fig.  1  represents  the  ■ n  ■  ^  a — -  7  n2  dimensional  parameter  space 


for  the  range  of  the  mapping.  The  coefficients  of  the  second 
degree  terms  in  the  control  system  define  a  point  in  this  space. 


denoted  by 


The  range  of  L  is  represented  by  a  straight 


line  going  through  the  origin.  Those  points  in  the  range  space  of 
L  that  exactly  satisfy  (40)  will  lie  on  this  line.  Among  these 

infinitely  many  points  we  want  to  find  the  one  (shown  as 


on  the  figure)  which  will  minimize,  with  respect  to  a  norm  as 

defined  earlier,  the  error  between  the  actual  system  that  is  being 

approximately  linearized  and  a  model  which  is  exactly  linearizabte 

(up  to  degree  2)  by  the  coordinate  change  and  feedback.  Fig.  2 
2 

shows  the  n  («+ 1)/2  +  n(n+l)/2  +  n  dimensional  domain  space  of 
the  mapping ,  and  the  minimization  done  in  the  domain  space. 


The  numerical  solution  to  (40)  is  found  by  linear  algebraic 
methods.  For  illustration  purposes  consider  a  mapping 
/4  :  iR^  — >(RM  (45) 


and  solve 

Ax  =  b.  (46) 

If  the  mapping  is  not  of  full  rank,  it  can  be  expanded  as  follows 


a):x  ~>(ax)s  IrN  +  M  (47) 

here  I  is  the  identity  matrix  of  appropriate  dimension.  The 
apping  (47)  is  always  of  full  column  rank.  Now  we  solve  for 

(48) 

ten  one  can  choose  a  metric  G  on  IRW  *  M 

C, 


;n  one  can  chc 

jc,,  0  -j 
L  0  G22J 

id  find  a  solution  to 

II,/-*  II 


min 

sIR 


N 


2 

'G  - 


le  well-known  solution  of  (50)  is 

=  (l/  at)G  ['  ])~\i  at1g[6°] 


(49) 


(50) 


(51) 


Finally,  we  note  the  following  correspondence  between  the 
dimensions  and  variables  in  (SI)  and  in  the  linearization  problem: 


[V’1 

(2)  . 

*  a 

U(1>J 


N :  n(n+\)f2  4  «(«+ 1)/2  +  « 

■eg 

[5  0  0  -t 
0*0 
0  0  (?-l 


5.  An  Example 

In  this  section  we  linearize  the  following  nonlinear  plant  using  the 
method  outlined  in  Sec.  3.: 

Jt,  =  x2  +  0.5xf  +  jc,tt  (52a) 

Xj  =  x3  -  x,x3  +  x2u  (52b) 

ij  =  u  +  0.5x|+  x2«  (52c) 

Calculation  of  the  coordinate  transformation  and  feedback  gives: 
Z|=x, -x,x3  (53a) 

z2  =  x2  +  0.5x*  -  XjXj  (53b) 

z3  =  x3  +  x,x2  -  x,x3  -x3  (53c) 

v  =  x,x3  +  1.5x2 -XjXj  +  (1  -x,  +x2- Zz3)u.  (54) 

With  the  above,  we  obtain  the  exact  linearization  (implying  that  the 
system  (52)  satisfies  the  Hunt-Su  condition): 
z,  =  z2  (54a) 

i2  =  z3  (54b) 

z3  =  V.  (54c) 

A  simple  feedback  design  v  =  Kz  that  places  the  closed  loop  poles 
at  locations  -1,  -0.707  ±  0.707 /  yields  the  gains  as  k,  =  -1,  kj  = 

-2.4142,  k3  =  -2.4142.  Using  v  =  AT(x  -  <|>(2)(x)),  (54),  and 
(27),  the  feedback  u  is  evaluated.  The  nonlinear  input  is  then 
applied  to  the  system  (52).  Note  that  this  will  introduce  O (x,u)3 
terms  into  (52).  In  Figs.  3  through  8,  we  present  simulation 
results  and  comparisons  for  the  above  linearization  and  feedback 
problem.  In  all  plots,  continuous  lines  represent  the  time  response 
curves  for  a  fictitious  linear  system  equal  to  the  linear  part  of  (52) 
with  feedback  u  =  Kx  applied.  The  higher  order  linearization 
method  of  this  paper  for  the  nonlinear  system  (52)  is  compared 
against  this  exact  linear  model  (dashed  lines).  A  feedback  design 
based  on  a  first  order  approximation  (i.e.  the  feedback  u  =  Kx), 
and  applied  to  the  nonlinear  model  (52)  is  plotted  with  dotted 
lines.  Figs.  3,  4,  and  5  show  the  responses  to  a  step  input  x2  * 
0.4.  In  Figs.  6, 7,  and  8  the  response  curves  for  a  step  input  x2  = 
-0.4  is  shown.  The  simulations  demonstrate  the  advantage  of  the 
proposed  nonlinear  feedback.  The  time  response  of  the  nonlinear 
system  with  nonlinear  controller  is  closer  to  the  response  of  a 
linear  system  (specifically,  the  linear  system  obtained  from  the 
first  order  part  of  the  vector  field)  than  a  control  design  based  on  a 
first  order  approximation.  The  effect  and  the  improvement  of  this 
nonlinear  control  on  the  stability  bounds  of  a  nonlinear  system  is 
being  investigated. 


Conclusion 

In  this  paper,  we  presented  a  method  to  solve  the  approximate 
linearization  problem  of  nonlinear  control  systems.  The  problem 
is  reduced  to  the  solution  of  a  set  of  linear  equations  as  follows: 
First,  the  generalized  homological  equations  are  derived.  By 
introducing  an  appropriate  basis  for  expressing  higher  degree 
monomials  in  the  vector  field,  a  set  of  equations  linear  in  the 
coefficients  of  the  monomials  are  found.  An  exact  solution  to 
these  set  of  equations  is  not  always  possible.  A  least  square 
solution  is  proposed  that  minimizes,  in  a  statistical  sense  as 
defined  above,  the  error  in  the  approximation. 

We  note  that  in  the  equivalent  linear  map,  the  case  when 
the  nonlinear  terms  to  be  cancelled  are  not  in  the  range  of  the 
mapping  exactly  correspond  to  the  violation  of  the  integrability 
conditions  in  the  Hunt-Su  linearization  theorem.  In  other  words, 
the  given  nonlinear  system  in  this  case  is  not  exactly  linearizable. 
In  the  method  developed  here,  we  still  find  a  "partially”  linearizing 
solution  to  this  problem.  The  least  square  solution  minimizes 
precisely  the  error  in  such  an  approximation. 

Especially  for  systems  with  higher  dimensions  and  higher 
degrees  of  approximation,  the  dimension  of  the  system  of  linear 
equations  may  become  extremely  large  and  difficult  to  solve.  A 
computer  program  that  automates  the  solutions  is  under 
development  by  the  authors. 

The  multi-input  case  for  the  generalized  homological 
equations  is  slightly  more  complicated  to  derive.  Research  is 
continuing  for  the  description  and  solution  of  these  equations  in 
the  most  general  input-output  setting,  and  for  an  arbitrary  degree 
of  linearization. 
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Fig.l  The  range  space  of  the  mapping. 


Fig.  2  Domain  space  of  the  mapping 


LEGEND  FOR  FIGURES  3  TO  8: 

( 1 )  : - Fictitious  linear  model  with  linear  feedbackconool. 

(2)  : - Nonlinear  system  with  nonlinear  cotitrol  (based 

on  the  method  of  this  paper) 

(3)  :  .  Nonlinear  system  with  control  based  on  first 

order  approximation  (same  control  as  the  fictitious  linear  model). 


Fig.  3  Time  domain  response  of  state  jc,;  Simulation  1. 


Fig,  4  Time  domain  response  of  state  Simulation  1. 


Fig.  5  Time  domain  response  of  state  Xy  Simulation  1. 


Fig.  6  Time  domain  response  of  state  x.;  Simulation  2. 


»  t 


Nonlinear  Controller  Design  via 

V' 


Approximate  Normal  Forms 


Arthur  J.  Krener 
Institute  of  Theoretical  Dynamics 
and 

Department  of  Mathematics 
University  of  California 
Davis,  CA  95616 


Research  supported  in  part  by  AFOSR-S5-0267 


1 


* 


1.  Introduction.  Over  the  past  several  years  a  group  of  faculty  and  graduate  students  at 
UC  Davis  have  been  developing  a  set  of  tools  for  the  design  of  controllers  and  observers  for 
nonlinear  systems.  Our  approach  has  been  based  on  normal  forms  and  approximate  normal 
forms  for  nonlinear  systems.  When  a  nonlinear  system  admits  a  normal  form  the  design  of 
a  controller  or  observer  is  greatly  simplified  and  standard  linear  design  tools  can  be 
employed.  The  people  that  have  been  involv  >d  in  this  program  are  Mont  Hubbard,  Sinan 
Karahan,  Andrew  Phelps,  Yi  Zhu,  Ruggero  Frezza  and  myself.  This  work  has  been 
supported  in  part  by  AFOSR.  In  this  paper  I'll  give  an  overview  of  our  program. 

2.  Normal  Forms.  Following  Kailath's  terminology,  [10],  there  are  four  normal  forms  for 
linear  systems,  i.e.,  controllable,  observable,  controller  and  observer  form.  The  first  two 
are  relatively  straightforward  to  obtain,  provided  the  system  is  controllable  or  observe ble. 
However,  the  latter  two  are  more  useful  in  the  design  of  stabilizing  state  feedback  control 
laws  and  asymptotic  state  observers.  If  a  linear  system  is  both  controllable  and  observable 
then  it  admits  all  four  normal  forms. 

In  [14]  we  discussed  the  nonlinear  generalizations  of  the  four  linear  normal  forms. 
Unfortunately,  even  controllable  and  observable  nonlinear  systems  do  not  admit  all  four 
nonlinear  normal  forms.  A  nonlinear  system  which  admits  controller  normal  form  is 
sometimes  said  to  be  state  feedback  linearizable  in  the  sense  of  Hunt— Su  [8]  and 
Jakubczyk— Respondek  [9].  For  a  system  in  controller  normal  form,  the  design  of  a 
stabilizing  state  feedback  control  law  is  a  straightforward  task.  However,  most  systems  do 
not  admit  a  controller  normal  form  and  even  when  one  does,  the  transformation  of  a 
system  into  controller  normal  form  involves  solving  a  system  of  first  order  linear  partial 
differential  equations  which  can  be  quite  difficult. 
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Similar  remarks  are  even  more  appropriate  for  observer  normal  form.  For  a  system 
in  observer  form,  the  design  of  an  observer  is  a  straightforward  task.  But  very  few  systems 
admit  such  forms  and  the  computation  of  observer  normal  form  is,  in  general,  extremely 
difficult. 

For  these  reasons,  we  have  introduced  approximate  versions  of  nonlinear  controller 
and  observer  form  [15,  16].  These  may  be  thought  of  as  finding  systems  nearby  to  the 
original  which  admit  controller  or  observer  form.  The  computation  of  such  a  system  is 
relatively  straightforward,  and  reduces  to  solving  a  set  of  linear  equations.  Unfortunately, 
these  linear  equations  are  not  always  solvable  and  they  increase  in  size  quite  rapidly  with 
the  dimension  of  the  system. 

We  start  by  introducing  modified  versions  of  controller  and  observer  normal  forms 
of  the  nonlinear  system. 

(2.1a)  {  =  f(f)  +  g«)u 

(2.1b)  y  =  h(() 

(2.1c)  ((0)  ~  =  0 

around  the  nominal  operating  point  (°,  which  for  convenience  we  assume  to  be  0.  We 
assume  f(0)  =  0  and  h(0)  =  0.  If  this  is  not  the  case,  then  in  many  important  cases  it 
can  be  made  so  by  a  possibly  time  varying  change  of  state  and  output  coordinates.  As  is 
usual,  the  state  (  is  n  dimensional,  the  control  u  is  m  dimensional  and  the  output  y 
is  p  dimensional.  It  is  relatively  straightforward  generalization  to  consider  systems  where 
y  depends  directly  on  u,  as  in 

(2  Id)  y=h(0  +  k(()u, 


3 


* 

however  to  simplify  the  exposition  we  shall  not  do  so. 

We  are  interested  in  studying  (2.1)  under  the  pseudogroup  of  state  coordinate 
transformations  around  £°  =  0-  1°  [14]  we  studied  arbitrary  change  of  coordinates  and 
attempted  to  bring  the  system0  into  normal  form  based  on  prime  systems.  Such  normal 
forms  are  closely  related  to  Brunovsky  form  and  its  dual.  In  this  article  we  shall  restrict 
our  attention  to  changes  of  state  coordinates  x  =  x(£)  whose  Jacobian  at  £°  =  0  is  the 
identity 


|(0)  =  1- 

Such  transformations  have  two  virtues.  The  first  is  that  they  leave  invariant  the 
first  order  linear  approximation  to  (2.1), 


(2.2a) 

z  =  Az  +  Bu  +  0(z,u) 

(2.2b) 

y  =  Cz  +  0(z)2 

(2.2c) 

z(t)  =  £(t) 

where 

(2.3a) 

A  =  f(0) 

(2.3b) 

B  =  g(0) 

(2.3c) 

c  =  $j(0) 

The  second  is  that  the  nonlinear  coordinates  £  and  the  normal  form  coordinates  x 
agree  to  first  order, 
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(2.4a)  £  =  x  +  4>(x) 

where 

(2.4b)  m  =  0,  H  (0)  =  0. 

Typically,  the  original  coordinates  in  which  the  system  is  described  have  some  natural 
meaning  and  the  coordinates  have  different  dimensions,  e.g.,  distance,  velocities,  mass,  etc. 
Property  (2.4)  means  that  at  least  to  first  order  the  normal  form  coordinates  have  the  same 
dimensions  and  intuitive  meanings  as  the  natural  coordinates. 

The  system  (2.1)  admits  a  modified  controller  form  if  there  exists  a  change  of  state 
coordinates  (2.4)  which  transforms  (2.1)  into 

(2.5a)  x  =  Ax  +  Bu  +  B(<*(x)  -f  /?(x)u) 

(2.5b)  y  =  Cx  +  7(x) 

It  follows  from  (2.4)  that  the  nonlinear  terms  are  quadratic  or  higher  in  (x,u),  i.e. 

(2.6a)  o(0)  =  0,  ||  (0)  =  0 

(2.6b)  p(  0)  =  0, 

(2.6c)  7(0)  =  0,  ^  (0)  =  0. 


We  require  that  the  m  x  m  matrix  1  /?(x)  be  invertible  for  x  of  interest. 

These  conditions  (2.4)  and  (2.6)  insure  that  A,  B,  C  are  given  by  (2.3).  Hence  the  linear 
part  of  modified  controller  form  of  (2.1)  is  the  same  as  the  first  order  approximation  (2.2) 
to  (2.1). 
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The  system  (2.1)  admits  a  modified  observer  form  if  there  exists  a  change  of  state 
coordinates  (2.4)  which  transforms  (2.1)  into 

(2.7a)  x  =  Ax  +  Cu  4-  a(y)  +  $(y)u, 

(2.7b)  y  =  y  +  7(y)» 

(2.7c)  y  =  Cx. 

It  follows  from  (2.1)  that  the  nonlinear  terms  are  quadratic  or  higher  in  y,  i.e. 

(2.8a)  a(0)  =  0,  ^  (0)  =  0 

(2.8b)  0(0)  =  0, 

(2.8c)  t(0)  =  0,  &  (0)  =  0. 

We  require  that  the  mapping  (2.7b)  be  invertible  between  the  y  and  y  of  interest. 
Once  again  the  A,  B,  C  of  (2.7)  are  the  same  as  those  of  (2.2,  3,  5). 

Henceforth  we  shall  drop  the  "modified"  and  refer  to  (2.5)  and  (2.7)  as  controller 
and  observer  forms.  Of  course  generally  it  is  not  the  same  change  of  coordinates  taking 
(2.1)  to  (2.5)  and  (2.7)  and  the  a,  0,  7  are  different.  In  particular,  the  dimensions  and 
arguments  of  0,  0,  7  differ  between  (2.5)  and  (2.7).  When  necessary  we  shall  use 
subscripts  c  and  o  to  distinguish  controller  coordinates  x  =  <£  —  0  (x  )  and  functions 

V  Li  V 

a  (x  ),  ^r(xr)’  7r(xr)  fr°m  observer  coordinates  x  =£—  <pAx  )  and  functions 

L'  v  v  L/  L<  v  v  (/  v 

«0(Cx0),  0o(Cxo),  7o(Cx0). 

3.  Poincare  Linearization.  Henri  Poincare  considered  the  problem  of  transforming  a 
nonlinear  vector  field  into  a  linear  vector  field  by  a  change  of  coordinates  around  a  critical 
point.  We  briefly  describe  his  theory,  a  fuller  description  can  be  found  in  Guckenhermer 
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and  Holmes  [6]  and  Arnold  [1]. 

We  are  given  a  single  vector  field 

(3-lc)  \  =  f(0 

(3.1b)  f(0)  =  0 

with  a  critical  point  at  £°  =  0.  We  are  interested  in  finding  a  change  of  coordinates  (2.4) 
which  transforms  (3.1)  into  a  linear  vector  field, 

(3.2)  x  =  Ax 

where  A  is  given  by  (2.3a). 

Poincare  noted  that  one  could  develop  the  change  of  coordinates  term  by  term  in 
homogeneous  powers  of  x.  At  degree  two  we  seek  an  n  dimensional  vector  field  4^\x) 
whose  entries  are  homogeneous  polynomials  of  degree  2  in  x  such  that  under  the  change 
of  coordinates 

(3.3)  (  =  x  +  <^2)(x) 

the  differential  equation  (3.1)  is  transformed  to 

(3.4)  x  =  Ax  +  O(x)3 

whose  O(x)3  denotes  cubic  and  higher  terms  in  x.  Superscripts  in  parentheses  will  be 
used  to  indicate  that  the  function  is  homogeneous  of  the  degree  of  the  superscript  in  its 
arguments.  If  we  expand  f(£)  in  homogeneous  powers  of 
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(3.5)  f(«  =  \(  +  f<2>«)  +  +  •  •  • 

then  (3.1)  is  transformed  into  (3.4)  iff  <p^  (£)  satisfies  the  so  called  homological  equation 
(3.6a)  [  Ax,  0^(x)]  =  f^(x) 


where  [  ,  ]  is  the  Lie— Jacobi  bracket 


It  is  straightforward  to  verify  that  [Ax,  •]  is  a  linear  map  from  homogeneous 
vector  fields  of  degree  2  into  homogeneous  vector  fields  of  degree  2.  Moreover  the 
homogeneous  n  dimensional  vector  fields  of  degree  2  form  a  linear  space  of  dimension 
n2(n+l)/2.  Hence  (3.6a)  is  solvable  for  arbitrary  f^  iff  zero  is  not  an  eigenvalue  of  the 
linear  mapping  defined  by  [Ax,  •].  Poincare  noted  that  the  eigenvalues  of  this  mapping 
are  related  to  the  eigenvalues  of  A  in  a  simple  fashion.  To  see  why,  suppose  A  is 
semisimple,  i.e.,  there  exists  a  basis  v*,...,vn  of  eigenvectors  of  A 


(3.7a)  Av1  =  Aj  v1 

possibly  over  the  complex  numbers. 

Let  Wj,...,wn  be  a  cobasis  of  left  eigenvectors  of  A, 

(3.7b)  WjA  =  Aj  Wj 
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Then  the  space  of  n  vector  fields  homogeneous  of  degree  2  has  as  a  basis 

(3.8)  ^j(x)  =  vk(wjx)  (woe) 

when  1  <  i  <  j  <  n  and  1  <  k  <  n.  A  straightforward  calculation  yields 

[Ax,  j(x)]  =  (Aj  +  A.  -  Afc)  <(>\ j(x). 

Hence  the  eigenvalues  of  [Ax,  •]  on  vector  fields  homogeneous  of  degree  2  are 


(3.9)  Aj  +  Aj  -  Ak 

when  1  <  i  <  n  and  1  <  k  <  n. 


Hence  the  homological  equation  (3.6a)  is  solvable  if  no  expression  of  the  form  (3.9) 
is  zero.  Of  course  this  is  a  sufficient  but  not  necessary  condition  because  a  particular  f^ 
might  well  be  the  range  of  [Ax,  •],  e.g.,  f^2^  =  0. 

If  (3.6a)  is  solvable  one  can  proceed  to  look  for  a  transformation  canceling  the  third 
degree  terms  in  f, 


(3.10) 


£  =  x  + 


and  [Ax,  •]  is  linear  mapping  of  these  vector  fields  homogeneous  of  degree  3  into 
themselves.  The  eigenvalues  of  this  mapping  are 


(3.12) 


A.,  +  ^  +  Ak  -  A, 
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where  l<i<j<k<n,  l<^<n. 

Hence  (3.11)  is  solvable  for  arbitrary  f(3)  iff  none  of  (3.12)  is  zero.  This  generalizes  to 
higher  degree.  If  one  of  (3.9)  or  (3.12)  or  their  generalization  if  zero  then  there  is 
"resonance"  and  linearization  is  not  always  possible.  We  refer  the  reader  to  [1]  and  [6]  for 
more  details. 

4.  Approximate  Controller  Form.  S.  Karahan  in  his  Ph.D.  thesis  [12]  studied  the 
application  of  Poincare's  method  to  finding  controller  forms  and  approximate  controller 
forms.  We  give  a  brief  description  of  his  work. 

One  starts  by  expanding  (2.1)  into  homogeneous  powers  of  (£,  u), 

(4.1a)  £  =  A(  +  Bu  +  f<2)(0  +  g(1)(£)  u  +  ... 

(4.1b)  y  =  C£  +  h(%)  +  ... 

One  seeks  a  change  of  coordinates 

(4.2)  £  =  x  4-  0^(x) 

transforming  (4.1)  into  approximate  controller  form 

(4.3a)  x  —  Ax  +  Bu  +  B(c/2)(x)  +  /^^(x)u)  +  0(x,u)3 

(4.3b)  y  =  Cx  +  7^(x)  +  O(x)3 

Following  Poincare,  we  see  that  this  will  happen  iff 
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(4.4a) 

(4.4b) 


[Ax,  <^“)(x)]  +  B  ar^(x)  =  f^2)(x) 
[Bu,  <^2)(x)]  +  B//^(x)u  =  g^(x)u 


where  (4.4b)  must  hold  for  each  constant  u.  We  refer  to  these  as  the  generalized 
homological  equations.  Like  the  homological  equations,  they  are  linear  equations  but  they 
are  generally  not  square.  The  space  of  unknown  ^2)(x),  a^(x)  and  is 

n2(n+l)/2  +  mn(n+l)/2  +  m2n  dimensional.  The  constraint  space  of  f^2)  and  g^  is 
n  (n+l)/2  +  n  m  space.  These  dimensions  agree  iff  n  =  2m+l.  Generally  the  map 

is  not  of  full  rank  so  it  is  not  always  solvable  even  when 

n  =  2m+l. 

Karahan  has  analyzed  this  mapping  using  a  basis  and  cobasis  related  to  the 
controllability  matrix  (B,  AB,  ...,  An-1B).  We  refer  the  reader  to  [12]  for  details. 

Since  the  system  (4.4a)  is  generally  not  solvable  one  is  forced  to  seek  approximate 
solutions.  One  way  of  doing  so  is  to  find  f^2^  and  g^  in  the  range  of  the  mapping  (4.2) 
which  is  closest  in  some  least  squares  sense  to  the  given  f^  and  g^.  Moreover  one 
would  like  to  choose  the  smallest  <p^\  and  which  maps  into  f^2^  and  g^1^. 
Again  we  refer  the  reader  to  [12]  for  more  details. 

Before  closing  this  section  it  should  be  mentioned  how  an  approximate  controller 
form  (4.3)  can  be  used  to  stabilize  a  nonlinear  system  (2.1)  (or  equivalently  (4.1))  by  state 
feedback.  The  standard  approach  is  to  approximate  the  nonlinear  system  to  first  order  by 
(2.2),  choose  a  stabilizing  feedback  law  for  (2.2),  u  =  Fz,  transform  this  back  into  original 
coordinates, 

(4.5)  u  =  F£. 

Expressed  in  homogeneous  terms  the  closed  loop  dynamics  is 
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(4.6)  |  =  (A  +  BF)  f  +  f<2)«)  +  g(1)(f)  F«  +  0(?)3 

and  hence  the  system  is  locally  stable  around  £°  =  0.  Of  course,  if  it  is  too  far  from 
(°  =  0,  the  quadratic  and  higher  terms  may  drive  it  unstable. 

In  the  normal  form  approach,  we  typically  will  use  the  same  stabilizing  state 
feedback  gain  F  but  to  apply  it  to  the  second  order  linearization  (4.3)  rather  than  the  first 
order  linearization  (2.2).  The  resulting  feedback  is 

(4.7)  u  +  ar^(x)  +  )(x)u  =  Fx 

which  results  in  x  coordinates  the  closed  loop  system 

(4.8)  x  =  (A  +  BF)x  +  O(x)3 

Generally  speaking,  it  is  better  to  implement  the  feedback  in  the  original  f  coordinates 
taking  advantage  of  the  fact  that  the  inverse  to  (4.2)  is 

(4.9)  *  ={-•/»<%) +  0({)3 

Neglecting  higher  than  quadratic  terms  we  obtain  from  (4.7)  the  feedback 

(4.10)  u  =  F*-(F*<2)(0  +  a(2>(?)  +  /tl)(()  F()  +  Otf)3. 

Note  that  to  first  degree  the  standard  feedback  (4.5)  and  the  feedback  (4.9)  agree. 
However,  the  second  degree  terms  of  (4.10)  cancel  the  second  degree  terms  of  (4.6)  to 
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obtain  in  x  coordinates  (4.8).  One  expects  that  (4.10)  is  asymptotically  stabilizing  over  a 
larger  neighborhood  of  £°  then  (4.5). 

Of  course  one  can  also  seek  a  higher  degree  approximate  controller  form.  The 
dimensions  of  the  homological  and  generalized  homological  equations  grow  exponentially  in 
the  degree  of  the  approximation.  Hence  this  approach  may  not  be  recommended.  It  might 
be  more  efficient  and  effective  to  find  approximate  controller  forms  of  degree  two  around 
several  operating  points  rather  than  an  approximate  controller  form  of  degree  three  around 
a  single  point. 

5.  Approximate  Observer  Form.  The  work  I'm  about  to  describe  is  joint  with  Andrew 
Phelps.  We  seek  a  change  of  coordinates  of  the  form  (4.2)  which  transforms  (4.1)  into 
approximate  observer  form 

(5.1a)  x  =  Ax  -f  Bu  +  +  //^(y)u  +  0  (x,u)3 

(5.1b)  y  =  Cx  +  7(2)(y)  +  O(x)3 

(5.1c)  y  =  Cx. 

As  before  this  is  possible  iff  we  can  solve  another  set  of  generalized  homological 
equations 

(5.2a)  [Ax,  <p^\x)}  4-  a^(Cx)  =  f^(x) 

(5.2b)  [Bu,  ^2^(x)]  /?(*)(Cx)u  =  g^(x)u 

(5.2c)  7(2)(Cx)  -  C<^2)(x)  =  h(2)(x) 

As  before  (5.2b)  must  hold  for  each  constant  u. 
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(2) 

These  equations  are  a  linear  mapping  from  the  space  of  functions  ;(x), 

</2)(Cx),  /^(Cx),  7^2^(Cx)  to  the  space  of  functions  f^2^(x),  g^(x),  h^2^(x).  The 

o  o 

dimension  of  the  domain  is  n“(n+l)/2  +  np(p+l)/2  +  m  2p  4-  p  (p+l)/2  and  that  of  the 
range  is  n2(n+l)/2  +  mn(n+l)/2  +  pn(n+l)/2.  In  general,  these  equations  (5.2)  are  r.ot 
solvable  so  as  before  one  must  seek  a  least  squares  solution.  We  shall  report  on  that  in 
more  detail  at  another  time. 

If  (2.1)  (equivalently  (4.1))  can  be  transformed  to  approximate  observer  form  then 
it  is  easy  to  construct  an  observer.  We  choose  H  so  that  A  +  HC  is  sufficiently  stable. 
An  approximation  x(t)  to  x(t)  is  defined  to  evolve  according  to 

(5.3)  x  =  (A  +  HC)x  +  Bu-H(y-7(2)(y)) 

+  a^2\y-^2\y))  +  (fi2\ y-^2\y))n 

then  the  error  x(t)  =  x(t)  —  x(t) 
satisfies 


(5.4)  x  =  (A  +  IIC)  x  +  0(x,  x,  u)3 

Hence  if  the  initial  error  is  not  too  large  and  u  is  also  not  too  large,  we  can  expect 
x(t)  -*  0  as  t-*oo. 

Of  course,  it  is  preferable  to  implement  the  observer  in  natural  coordinates  so  we 
transform  (5.3)  using  £  =  x  +  to  obtain 
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(5.5a) 


t  =  A£  T  Bu  +  H(y  -  y)  4-  f^(£)  +  g(1)(£)  u 
+  a^(y)  -  a^2\y)  +  (l^2\y)  -  (fi2\y))u 

+  H(7^(y)  -  7^(y))  +  (0  H(y  -  y) 

dS 

+  ou,  i  u)3 

(5.5b)  y  =  C£  +  h(2)(0 

(5.5c)  £(0)  =  £  =  0. 

Notice  that  the  linear  part  of  (5.5)  is  the  observer  for  (2.1)  one  would  obtain  from  the 
linear  approximation  (2.2),  namely 


(5.6a)  z  =  Az  +  Bu  +  H(y  —  y) 

(5.6b)  y  =  Cz 

(5.6c)  z(0)  =  £°  =  0. 

The  error  z  =  £  —  z  between  (2.1)  and  (5.6)  satisfies 


(5.7a)  z  =  (A  +  HC)  z  +  0(&  i  u)2 

while  the  error  of  the  observer  (5.5)  expressed  in  x  coordinates  satisfies  (5.4).  Hence  one 
expects  (5.5)  to  perform  better  as  an  observer  for  (2.1)  over  a  larger  operating  range. 

As  with  the  state  feedback  (4.10),  the  second  degree  terms  of  the  observer  (5.5)  are 
a  correction  to  the  standard  linear  observer  for  the  quadratic  nonlinearations  of  the 
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original  system.  In  implementations  one  would  replace  the  state  £  in  the  state  feedback 
control  law  (4.9)  with  the  estimate  £  from  (5.5). 

One  can  continue  this  process  and  look  for  a  third  degree  change  of  coordinates 
which  transforms  the  system  into  approximate  observer  form  where  the  error  terms  are 
0(£,  u)  .  One  obtains  in  this  further  third  order  correction  to  the  state  feedback  (4.10) 
and  observer  (5.5).  Viewed  in  this  light,  we  see  that  the  approximate  normal  form 
approach  allows  us  to  start  with  a  standard  linear  design  based  on  the  linear  approximation 
(2.2)  and  build  in  a  succession  of  higher  degree  corrections  to  overcome  the  nonlineararities 
of  (4.1).  Throughout  we  can  keep  the  same  feedback  gain  K  and  observer  gain  H,  and 
these  can  be  chosen  by  standard  linear  design  techniques  applied  to  the  linear 
approximation  (2.2). 

6.  Coprime  Factorizations.  This  work  is  joint  with  Yi  Zhu  [19].  Suppose  we  have  a 
system  in  controller  normal  form 

(6.1a)  xc  =  Axc  +  Bu  +  B(ac(xc)  +  /?c(xc)u) 

(6.1b)  y  =  Cxc  +  7c(xc) 

(6.1c)  xc(0)  =  0 

where  the  c-subscripts  indicate  coordinates  and  functions  associated  to  controller  normal 
form.  We  view  (6.1)  as  defining  an  input/output  map 

(6.2a)  G:  u(')  i — »  y  (•) 

from  functions  u(t)  to  y(t)  for  t  >  0. 
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We  seek  a  right  factorization  of  G 


G  =  No  M 

where  N  and  M  are  input/output  maps 
(6.2b)  M:  v( •)  i — ♦u(-) 

(6.2c)  N:v(-)' — *y(‘)> 

M  is  invertible  and  o  denotes  composition.  There  is  a  large  and  growing  literature  on 
coprime  factorization  of  both  linear  and  nonlinear  systems.  A  sampling  is  [2-5,  7,  10,  11, 
13,  17,  18,  20-26].  In  particular  our  approach  follows  [3,  4]. 

To  describe  the  input/output  maps  M  and  N  we  shall  use  a  state  space 
realization.  In  particular  we  define  M  to  be  the  input/output  map  of 

(6.3a)  £c  =  (A  +  BF)  £c  +  Bv 

(6.3b)  ocUc)  +  (1  +  /?c(y)u  =  F£c  +  v 

(6.3c)  *c(0)  =  0 

where  (6.3b)  defines  u  as  a  function  of  £c  and  v. 

We  consider  the  composition  N  =  G  o  M,  this  is  realized  by  the  2n  dimensional 
system  (6.1,  3)  described  in  £f,  x£  coordinates.  Let  e  =  xQ-  (c  then 
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(6.4) 


e  =  Ae  +  B(—  Ffc  -  v  +  ac(xc) 

+  (i  +  0c(xc))  (i  +  /yyr1  (nc  +  v  -  ac(g» 

If  e(t)  =  0  then  e(t)  =  0.  Since  e(0)  =  0  we  conclude  that  e(t)  =  0  for  all  t  >  0.  Ii. 
other  words,  the  realization  (6.1,  3)  of  N  is  not  controllable  because  e(t)  is  unaffected  by 
the  input  v(t). 

A  controllable  realization  of  N  is 

(6.5a)  <c  =  (A  +  BF)  (c  +  Bv 

(6.5b)  y  =  CCc+7c(Cc) 

(6.5c)  <c(0)  =  0 

Hence  we  conclude  that  G  =  N  o  M  1  where  N  and  M  axe  realized  by  (6.5)  and  (6.3). 
Notice  that  M  is  invertible  since  (1  +  /?c)  is  invertible  by  assumption. 

Notice  also  that  if  (A,  B)  is  a  controllable  pair  then  we  can  choose  F  so  that  (6.3) 
and  (6.5)  are  stable  systems.  Hence  we  have  factored  G  over  the  ring  of  stable  nonlinear 
systems.  We  are  being  deliberately  vague  about  the  precise  definition  of  a  stable  nonlinear 
system.  It  is  clear  that  (6.3,  5)  are  "stable"  under  any  reasonable  definition. 

Of  course,  we  are  interested  in  coprime  factorizations  over  the  ring  of  stable 
nonlinear  systems.  Again  we  should  not  try  to  make  this  concept  precise  but  following 
Hammer  [7]  and  others  we  shall  say  that  G  =  N  o  M  ^  is  a  coprime  factorization  if  there 
exists  P,  the  input/output  map  of  a  stable  system, 

(6.6a)  P:  (”)  _♦  w 
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such  that  the  composition 


(6.6b)  P  o  (M):  vi — >  (“)  i — .  w 

is  the  identity,  w  =  v. 

The  input/output  map  of  (^)  can  be  realized  by  an  n  dimensional  system 

(6.7a)  £c  =  (A  +  BF)  £c  +  Bv 

(6.7b)  yy  +  (1  +  /yy  )u  =  f^c  +  v 

(6.7c)  y  =  Cyt-  7c(Cc) 

(6.7d)  yo)  =  0 

A  left  inverse  of  (6.7)  is 

(6.8a)  zc  =  Azc  +  Bu  +  B  («c(zc)  +  0c(zQ)  u) 

(6.8b)  w  =  oc(zc)  +  (1  +  /?c(zc))u  -  Fzc 

(6.8c)  zc(0)  =  0 

If  e  =  -  zc  then 

e  =  Ae  +  B(ac(y  -  <*c(zc)  +  (yy  -  0c(zc))u) 

If  e(t)  =  0  the  e(t)  =  0  and  since  e(0)  =  0  it  follows  that  e(t)  =  0  for  all  t  >  0.  If 
e(0  -  yi)  _7-c(t)  =  0  then  w(t)  =  v(t)  so  (6.8)  inverts  (6.7). 


19 


However  we  do  not  know  that  (6.8)  is  stable.  To  insure  the  stability  of  (6.8),  we 
must  add  to  (6.8a)  an  extra  term.  This  term  must  stabilize  (6.8)  and  also  must  be  zero 
when  £c  =  z£  so  that  (6.8)  remains  a  left  inverse  of  (6.7).  How  do  we  find  such  a  term? 

Notice  that  the  dynamics  (6.8a)  is  the  same  as  the  dynamics  of  the  original  system 
(6.1a)  and  notice  that  the  other  output  y  of  (6.7)  does  not  appear  in  (6.8).  Perhaps  we 
can  inject  y  into  (6.8a)  to  stabilize  it?  This  is  more  or  less  equivalent  to  asking  whether 
output  injection  can  be  used  to  stabilize  the  original  system  (6.1).  This  is  always  possible 
for  systems  in  observer  form,  hence  we  assume  that  there  exists  a  change  of  coordinates 

(6.9)  xc  =  xQ  +  <p  (xQ) 

satisfying  (2.4b)  transforming  (6.1)  into  observer  form 

(6.10a)  xQ  =  AxQ  +  Bu  +  %(CxQ)  +  /?0(CxQ)u 
(6.10b)  y  =  CxQ  +  70(CxQ) 

(6.10c)  xQ(0)  =  0 

Suppose  we  consider  a  similar  change  of  coordinates  for  (6.8) 

(6.11)  zc  =  zQ  +  <p(zQ) 

to  obtain 

(6.12a)  zQ  =  Azq  +  Bu  +  «0(CzQ)  +  0Q(  CzQ)u 
(6.12b)  w  =  oo(zQ  +  <p{  zQ))  +  (1  +  Pc(zQ  +  0(zQ)))u 

-  F(*o  + 
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(6.12c) 


z0  (0)  =  0. 

We  add  to  (6.12a)  the  term 
(6.13a)  aQ(y)  -  %(CzQ)  +  (0Q(y)  -  P0(CzQ))  u  +  H(CzQ  -  y) 
to  obtain 

(6.12aa)  zQ  =  (A  +  HC)  zQ  +  Bu  +  aQ(y)  +  PQ  (y)  u  -  Hy 
where  y  is  a  function  of  y  of  (6.7c)  defined  by 
(6.13b)  y  =  y  +  70(y)  =  +  70(C  Q 

and  is  the  state  of  (6.7)  in  observer  coordinates 
(6.13c)  +  <t>(Q 


Notice  that  (6.13a)  is  zero  whenever  =  zQ,  hence  the  input/output  map  P  of 
the  (6.12aa,  b,  c)  is  also  an  inverse  of  (6.7).  Also,  if  (C,A)  is  an  observable  pair  then  we 
can  choose  II  so  that  (6.12aa,  b,  c)  is  stable. 

In  summary,  we  have  shown  that  if  a  nonlinear  system  admits  both  controller  and 
observer  form  than  its  input/output  map  G  can  be  factored  into  the  composition 
N  o  M  1  of  input/output  maps  of  stable  systems  N  and  M.  Moreover  this  composition  is 
coprime  in  the  sense  that  the  input/output  map  (^)  has  a  left  inverse  P  which  is 
realized  by  a  stable  system. 
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We  have  not  presented  this  as  a  theorem  because  we  are  reluctant  at  this  point  in 
time  to  give  formal  definitions  of  coprimeness  and  stability  for  nonlinear  systems.  However 
the  above  development  is  very  analogous  to  the  linear  theory  [3,  4].  See  also  Hammer  [7] 
Unfortunately  the  analogy  is  not  so  straightforward  for  left  copiime  factorizations. 
The  theory  of  left  coprime  factorizations  for  nonlinear  systems  has  some  substantial 
differences  with  the  linear  theory. 

We  start  with  a  system  in  observer  form  (6.10)  realizing  an  input/output  map  G. 
We  define  another  input  output  map 

(6-14)  M:  (y)  i — ►  w, 

by 

(6.15a)  £0  =  (A  +  HCK0  -  Hy  +  aQ<y)  +  0Q(y)  u 

where  y  is  an  invertible  function  of  the  input  y  defined  by 

(6.15b)  y  =  y  +  70(y) 

and  the  output  is 
(6.15c)  w  =  -C£o  +  y 

(6.1 5d)  £o(0)  =  0 
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Consider  the  serial  connection  of  (6.10)  and  (6.15),  this  is  not  a  realization  of  the 
M  o  G  but  it  is  a  realization  of  N  =  M  o  (^).  (This  is  the  first  important  difference  with 
the  linear  theory).  If  we  define  £0  =  xo  —  £0  then  N  is  realized  by 


(6.16a)  C0  =  (A  +  HC)C0  +  Bu 

(6.16b)  w  =  CCQ 

(6.16c)  <o(0)  =  0 

because  in  xQ  coordinates  for  (6.10,  15)  only  the  £o  coordinates  are  observable  from 
the  output  w.  We  consider  N,  M  as  a  left  factorization  of  G,  although  it  is  really  a  left 
factorization  of  (q)  in  the  sense  that 

(6.17)  M.(J)  =  N 

Notice  that  we  cannot  compose  this  on  the  left  with  1  since  M  is  not 
invertible  as  a  mapping  from  (^)  to  w. 

Perhaps  the  best  way  of  viewing  the  situation  is 


(6.18a) 


['  °1 

O 

I  ' 

G 

I 

L  0  M  J 

-  N  • 

or 


(6.18b) 


I 

G 


I  0 
0  M 


-1 


The  matrix  notation  is  somewhat  misleading  because  M  depends  on  both  u  and  v. 


/ 
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In  any  case,  if  (C,  A)  is  an  observable  pair  then  both  (6.15)  and  (6.16)  can  be 
made  stable  by  proper  choice  of  H.  In  particular,  the  nonlinearities  in  (6.15a)  are 
memoryless  functions  of  the  inputs  u  and  y  hence  (6.15)  is  BIBO  stable. 

Next  we  address  the  coprimeness  of  the  above  factorization. 

We  consider  the  input/output  map 

(6.19a)  [—  N,  M]:  (y)  i — »  w 

where  again  the  matrix  notation  is  somewhat  misleading  since  both  u  and  y  are  inputs 
to  M,  i.e., 

(6.19b)  w  =  -  N(u)  +  M(u). 

This  input/output  map  can  be  realized  by  an  n  dimensional  system 
(6.20a)  £0  =  (A  +  HCKo  +  Bu  +  o(7)  +  /?D(y)u  -  Hy 

where  y  is  an  invertible  function  of  the  input  y  defined  by 

(6.20b)  y  =  y  +  70(y) 

and  the  output  w  is  given  by 
(6.20c)  w  =  -  C£q  +  y 
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We  wish  to  find  a  input/output  map  P  realized  by  a  stable  system  so  that  P  is  a 
right  inverse  of  [—  N,  M], 

(6.21a)  P:»— (J) 

(6.21b)  [-  N,  M]  o  P:  v  i — ►  w  =  v. 

We  start  by  constructing  an  inverse  for  (6.20), 


(6.22a) 

zQ  =  AzQ  -  Hv  +  Bu  +  a0(jO  +  0o(y)u 

(6.22b) 

y  =  Cz0  +  v 

(6.22c) 

y  =  y  +  70(y) 

(6.22d) 

u  =  ? 

(6.22e) 

zo(0)  =  0 

We  leave  unspecified  for  the  moment  the  output  u  which  also  appears  in  the  dynamics 
(6.22a).  Notice  that  if  e  =  £Q  —  zQ  is  the  error  between  the  states  of  (6.20)  and  (6.22) 
then  e  =  0  whenever  e  =  0.  Since  e(0)  =  0  we  conclude  that  e(t)  =  0  for  all 
t  >  0  and  so  by  (6.20c)  and  (6.22b)  we  have  w(t)  =  v(t).  In  other  words  (6.22)  is  a  right 
inverse  of  (6.20). 

What  about  the  stability  of  (6.22)?  We  would  like  to  choose  the  output  u  in  such 
a  way  that  (6.22a)  is  stable  in  some  sense.  If  we  ignore  the  -Hv  term  of  (6.22a)  this  looks 
like  the  original  system  is  observer  form.  This  is  not  exactly  true  because  y  is  defined  by 
(6.22b)  with  v  present.  Suppose  the  original  system  can  be  transformed  into  controller 
form  (6.1)  by  a  change  of  coordinates  (6.9).  If  we  apply  a  similar  change  of  coordinates 
(6. 1 1 )  to  (6.22)  we  obtain 
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(6.23a) 


zc  =  Azc  +  Bu  +  B(^zc)  +  ”  Hv 

- 1-  (Hv)  +  (1  +  If-)  K(Cz0  +  v)  -  a„(Cz0) 
o  o 

+  (0o(Czo  +  V)  -  /?0(Czo))u) 

Suppose  we  choose  an  F  such  that  (A  +  BF)  is  stable  and  define  u  by 
(6.22dd)  oc(zc)  +  Pc{zQ)  u  =  Fz£. 

When  the  input  v  =  0,  (6.23a)  becomes 
(6.23b)  zc  =  (A  +  BF)  zc. 

Unfortunately  we  cannot  conclude  that  (6.23a)  is  BIBO  stable  since  the  input  v  is 
multiplied  by  a  function  of  the  state. 

We  conclude  by  noting  that  a  "nonlinear  Bezout  identity"  holds  for  the  above.  In 
other  words  beside  P  being  a  left  inverse  (6.6b)  for  (^)  and  P  a  right  inverse  (6.21b) 
for  [—  N,  M],  it  is  also  true  that 

(6.24a)  [-  N,  M]  o  (“);  v  .  (“)  — ♦  w  =  0 


and 


(6.24b)  P  o  P:  v  i — ♦(J)» — ►  w  =  0 

In  abuse  of  notation  we  summarize  these  equations  by 
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(6.25) 


r  P 

J-N,  M] 


I  O' 
0  I 


The  verification  of  (6.24)  is  straightforward. 

From  the  work  of  Doyle  [3]  Francis  [4]  and  others  the,  the  existence  of  a  nonlinear 
Bezout  identity  suggests  that  it  might  be  possible  to  develop  a  nonlinear  version  of  Youla's 
Q  parameterization  of  all  stable  and  stabilizing  controller  of  a  linear  system.  This 
generalization  would  apply  to  those  nonlinear  systems  which  admit  both  controller  and 
observer  form.  This  class  is  very  thin,  but  perhaps  such  a  result  could  be  extended 
approximately  to  those  systems  that  approximately  admit  controller  and  observer  form. 
Work  in  these  areas  is  continuing. 

7.  Concluding  Remarks.  We  have  briefly  described  an  approach  to  nonlinear  compensator 
design  based  on  nonlinear  normal  forms  and  approximately  normal  forms.  This  approach  is 
being  pursued  by  a  group  of  researchers  at  U.C.  Davis  with  support  from  AFOSR.  The 
principle  advantage  of  the  normal  forms  approach  is  that  to  a  large  extent  it  reduces 
problems  in  nonlinear  design  to  problems  in  linear  design.  We  are  developing  software 
tools  which  utilize  this  approach  as  a  compliment  to  existing  linear  design  software  so  that 
these  linear  design  packages  can  be  used  for  nonlinear  systems  that  admit  at  least 
approximate  normal  forms. 
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...  A  reciprocal  diffusion  is  a 'thi*  ned' diffusion  in  the  following 
intuitive  sense:  view  the  paths  of  a  Markov  diffusion  on  [0,1]  as  the  shoots 
i  i  dense  thicket  of  highly  contorted  shoots,  all  cut  to  a  unit  height  (height 
is  the  independent  parameter).  A  reciprocal  diffusion  is  a  diffusion  thicket 
that  has  been  thinned  out  just  sufficiently  so  that  the  tops  and  bases  of  the 
shoots  conform  to  an  artificially  imposed  joint  distribution.  In  particular  a 
pinned  diffusion  is  a  reciprocal  diffusion  thinned  so  that  the  shoots  start 
and  end  at  single  points  (a  pointed  'bush').  Much  of  the  structure  of  the 
original  distribution  of  the  diffusion  will  be  present  in  all  its  thinnings.  The 
aim  of  this  paper  is  to  give  a  local  characterization  of  this  common 
'reciprocal'  structure. 

Krener  (1988a)  has  made  a  fundamental  study  of  the  infinitesimal 
properties  of  reciprocal  diffusions.  The  problem  of  local  characterization  is 
one  of  a  number  of  intriguing  questions  that  were  only  partially  answered  in 
his  paper,  ft,  turns  out  that  the  reciprocal  structure  of  a  Gaussian 
reciprocal  diffusion  has  a  beautifully  simple  characterization  in  terms  of  a 
linear  self-adjoint  second  order  differential  operator  (Frezza,  Krener,  Levy, 
19S9).  Our  concern  is  with  the  nonGaussian  case,  which  has  to  be  treated 
by  different  methods. 

To  proceed  it  is  necessary  to  introduce  some  notation.  Let  D  be 
th^  space  of  IKn— valued  continuous  functions  on  [0,1],  x  (w)  the  coordinate 

map  u{\ )  and  ,5r  the  Hotel  e-field  generated  by  the  topology  of  uniform 
convei pence.  Suppose  X  =  ({x  :  0  <  l  <  1 },  <>,  O')  is  a  Maikov  diffusion 
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with  continuous  strictly  positive  initial  and  transition  probability  densities 
Pq(x)  and  pgx(t,y),  0  <  s  <  t  <  1.  It  is  a  straightforward  matter  to  show 

that  the  probability  law  IP  of  x  has  a  unique  factorization  into  the  initial 
distribution  Pg(x)  dx  the  final  conditional  distribution  Pgx(l,y)  dy  and  a 

probability  kernel  (x,y)  IPq^  on  narrowly  continuous  as  a  function 

of  (x,y),  that  is  the  law  of  X  conditional  on  Xq  =  x,  x^  =  y. 

Now  suppose  k(x,y)  is  a  strictly  positive  continuous  probability 
density  on  IRn  x  IRn  and  let  be  the  modified  law  on  3f\  A): 

=  k(x,y)  dxdy.  Then  we  shall  call  the  process  Y  =  ({x^},  <J), 

with  law  Q  on  (ft,  &),  a  reciprocal  diffusion  governed  by  a  Markov 
diffusion  X.  If  the  marginal  density  k  is  replaced  by  a  distribution 
rr(dx,  dy)  we  could  create  other  reciprocal  diffusions  f  m  X  with  laws 
singular  to  IP;  for  instance,  the  pi nnings  of  X  at  t  =  0  and  1,  but  to 
keep  the  development  simple  we  restrict  our  attention  to  governed 
reciprocal  diffusions.  Markov  diffusions  are  trivially  reciprocal  and  two 
diffusions  governing  the  same  reciprocal  diffusion  govern  each  other;  in  fact 
it  follows  from  Theorem  3.1  of  (Jamison,  1974)  that  the  ratio  of  their 
end-point  densities  must  take  the  form  hg(x)  h^(y)  for  some  continuous 

positive  functions  h^  and  h^.  Reciprocal  diffusions,  on  the  other  hand, 

are  not  generally  Markov.  For  instance,  if  Y  has  the  law  of  (Z  +  B.: 

V 

0  <  t  <  1},  where  (BJ  is  a  Brownian  bridge  with  Bq  --  B^  =  0,  and  Z 

is  an  independent  nondegenerate  random  variable,  then  Y  is  a  reciprocal 
diffusion  (governed  by  a  Brownian  motion)  that  is  not  Markov.  However, 
reciprocal  diffusions  inherit  many  of  the  properties  of  Markov  diffusions. 

For  0  <  r  <  t.  <  1 ,  let  J *r  denote  the  interior  a— field  o  (x^:  r  <  s  <  t) 

and  the  cxtenor  e-field  o  (x  :  0  <  s  <  r  or  t  <  s  <  1 )  in  3r.  Then  a 

reciprocal  diffusion  Y  is  a  Markov  field  on  the  line  in  the  sense  that  for 
any  0  <  r  <  t  <  1 ,  conditioned  on  x  and  x,,  elements  of  .Z1  are 

independent  of  elements  of  #j'.  Also  for  any  0  <  s  <  1  the  pinned  process 

^  U-X|}-Q('  I  Ns  N))  is  a  Mai  kov  diffusion.  Thofust  prop'Mt  v  is  I  h<’ 
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definition  of  the  reciprocal  process  introduced  by  Bernstein  (1932)  to  give  a 
probabilistic  interpretation  of  Schrodinger's  equation;  the  second  the 
definition  of  a  reciprocal  diffusion  in  the  sense  of  Krener  (1988b).  A 
detailed  study  of  these  and  other  properties  can  be  found  in  Jamison  (1974, 
75).  A  reciprocal  process  possesses  a  'governing'  Markov  process  provided 
some  technical  'end— point'  conditions  are  satisfied.  The  writing  of  a 
governing  diffusion  into  the  definition  adopted  here  simply  allows  us  to 
avoid  these  technicalities. 

The  kernel  (x,y)  -*  P*^  can  be  thought  of  as  the  reciprocal  structure 

that  is  common  to  the  Markov  diffusion  X  and  all  its  subservient 
reciprocal  diffusions.  Similarly  for  s  <  t  the  unique  narrowly  continuous 
probability  kernel  on  ^  given  by  P*^(A)  =  P(A|x =  x,  Xp=  y) 

b  SX  b 

represents  the  common  reciprocal  structure  on  the  interval  [s,t]. 

We  can  now  define  the  notion  of  a  set  of  local  reciprocal 
characteristics.  Let  dt  denote  the  set  of  all  probability  laws  of  reciprocal 
diffusions.  We  shall  say  that  an  assignment  of  a  continuous  function 
py (•,•)  on  [0,1]  x  R  to  every  law  Q  in  Si  is  a  reciprocal  invariant  on 
($,£,]  if  the  restriction  of  p^(u,-)  to  the  interval  s<u<t  is  the  same  for 
all  Q  with  a  common  reciprocal  structure  on  [s,t]  and  that  p"  is  a  local 
reciprocal  invariant  if  it  is  a  reciprocal  invariant  on  [s,t]  for  all 
0  <  s  <  t  <  1.  Finally  we  shall  say  that  a  family  {pj,  p9,  •  •  • ,  pn}  of  local 

reciprocal  invariants  is  a  set  of  local  reciprocal  characteristics  if  it  uniquely 
determines  the  reciprocal  structure  on  [0,1];  that  is,  given  a  set  of 
functions  {r ^ ,  r^,  •  •  •,  rn)  within  the  joint  range  of  {pp  •  •  • ,  pn)  those 

laws  in  the  inverse  image  (Q:  p^  =  r- ,  i  =  1,  •  •  •,  n}  possess  a  common 

reciprocal  structure  Pq  . 

The  next  section  presents  a  theorem  showing  how  a  set  of  local 
reciprocal  characteristics  for  a  reciprocal  diffusion  can  be  constructed  by  a 
differential  transformation  of  its  scmimartingalc  local  characteristics. 

The  final  section  presents  tw°  formulae  that  give  a  probabilistic 
interpretation  of  local  reciprocal  characteristics.  The  first  is  a  curiously 
factored  form  of  (Kirsanov's  formula  t  hat  is  applicable  to  pinned  diffusions 

The  second  requires  some  preliminary  explanation  Kroner  (  1  ‘ ) S S 1  > )  - 
ha  mi  reduced  ,i  numbei  o!  i  i  >  l  ei  j  e| ;  1 1 « •< )  1-eliei  ljl,(«  j  »<  »j  n  1 ,  1 1  <  •  .  ih.it  an- 
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satisfied  by  reciprocal  diffusions  and  that  display  the  infinitesimal  form  of 
particular  conditional  amounts  of  their  first  and  second  differences.  The 
diffusion  matrix  a(t,x)  (that  is,  the  rate  of  change  of  the  matrix  quadratic 
variation  process)  plays  a  central  role.  One  important  postulate  states  that 
for  constant  x,  v  (  IRn,  the  limit* 

limM  EK+i,  -  2xi  +  Vh  I  xt+h  = x  +  vh-  Vh  = x  -  vhl 

=  f(t,x)  +  g(t,x)v  (1) 

for  some  vector  and  matrix  functions  f  and  g  (when  expressed  in 
Riemann  normal  coordinates  with  respect  to  the  Riemann  tensor  a  ). 
Another  is  the  corresponding  expression  (l1)  for  the  (matrix)  second 
moment  of  the  second  difference;  here  the  right  member  is  2  a(t,x). 

Our  second  formula  (Theorem  3)  can  be  regarded  as  an  integrated 
non— asymptotic  variation  of  (1)  for  constant  diffusion  matrices  a. 
Together  with  the  matrix  a  its  coefficients  form  a  set  of  local  reciprocal 
characteristics.  So,  at  least  in  this  integrated  form,  Krener's  postulate  (1), 
together  with  the  matrix  a,  characterize  the  reciprocal  structure.  The 
general  case  with  non-constant  a  will  be  treated  elsewhere. 

My  thanks  are  due  to  A.J.  Krener  who  besides  offering  me 
hospitality  and  support  at  U.C.  Davis,  largely  provided  the  inspiration  for 
this  work,  and  to  B.C.  Levy  and  R.  Frezza  for  educating  me  in  the  ways  of 
Gaussian  reciprocal  processes. 


In  (1988a)  Krener  uses  a  slightly  different  conditioning  in  his  postulates: 
E! '  I  Vi,  +  Vi.  =  2xl-  T  his  introduces  quantities  sucli  as  Nelson's 

current  velocity  that  are  not  generally  reciprocal  invariants. 
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A  LOCAL  RECIPROCAL  CHARACTERIZATION. 

From  now  on  we  use  the  summation  convention.  Using  the  notation 
of  the  previous  section,  suppose  x  =  ({x^.}:  0  <  t  <  1}  ft,  P)  (or  ({xt}, 

IP)  for  short)  is  a  Markov  diffusion  in  Rn  with  diffusion  matrix  a(t,x)  and 
drift  vector  b(t,x).  For  simplicity  we  assume  that  the  components  of  a 
and  b  are  C^,  (that  is,  they  are  bounded  and  have  bounded  derivatives  of 

all  order)  and  that  a  is  uniformly  elliptic.  Then  x  possesses  a  strictly 
positive  transition  density  psx(t,y).  It  is  also  a  continuous  semi  martingale 

on  { oTj}  and  has  the  decomposition 
dxt  =  b(t,  xt)  dt  +  dNf 

where  {N^}  is  a  continuous  martingale  on  {<7q}  diffusing  at  the  rate 
a(t,x  );  that  is,  its  (matrix)  quadratic  variation  is  of  the  form 

L 

ft 

a(s,  x  )ds. 

J0  s 

Now  suppose  Y  =  ({xj,  <))  is  a  reciprocal  diffusion  governed  by  X. 
Suppose  k  is  its  positive  continuous  end-point  density,  and  k^  its  initial 
density.  Let  h  ,(l,z)  be  the  conditional  end-point  density  relative  to 
P0x(l,z)  given  by 

hx(l,z)  =  k(x,z)/(kQ(x)  P0x(l,z)) 

then  h  ,(!,*)  is  positive  and  continuous  and  possesses  a  'space-time 
harmonic  extension'  with  respect  to  x  given  by: 

!ix(t,y)  =  \.(l,z)  pty(l,z)  dz,  for  0  <  t  <  1. 

Let  D  denote  ()/(h  and  I);  denote  Hy  lto's  formula  li x 

:  ;ii  isfu’S  I  foi  fixed  x.  wil  li  1 1n-  summation  convoni.ioit  in  foi  < ■<■) 
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DOhx  +  biDihx+aijDiDjhx  =  0 

on  [0,1]  xlRn.  Then  Mt:  =  hx  (t,  xfc)  is  the  martingale  Ejp[dQ/dlP|  <7q] 
and  has  the  representation 

dM*  =  M*  .  5^  D.(tag  h  (t,  xt))  dNt. 


A  'Girsanov'  argument  then  leads  to  the  representation:  On  (fl,  Q), 


dxt  =  cx  (t,  xfc)  dt  +  df?t, 


where  c1  (t,y)  =  b^tjy)  +  a1-^  D.  log  h  (t,y)  and  where  N  is  a  continuous 
x  j  x  t 

martingale  on  {<7q},  again  diffusing  at  rate  a(t,  xt).  The  full  details  of 

the  argument  can  be  found  in  Theorem  2  (and  its  proof)  in  Jamison  (1975). 
Notice  that  c*(-,-)  is  continuous  and  uniquely  defined.  The  pair  a  and 

c  can  be  thought  of  as  the  local  semimartingale  characteristics  of  Y  with 

respect  to  the  filtration  {  ,/q}.  We  can  obtain  a  different  characterization 

by  conditioning  at  t  —  1;  that  is,  with  respect  to  the  augmented  filtration 
{#[},  then 


dx^  =  ex  (t,  xjdt  +  d(martingale) 

where  the  martingale  (on  {£*})  still  diffuses  at  rate  a(t,  x^)  and  ez 
takes  the  form:  for  0  <  t  <  1 

°^(b,y)  =  b'(t,y)  +  a'j(t,y)  D.  log  p(v(l,z) 

1  he  argument  is  almost  the  same,  though  now  e  |i..v)  becomes  .singular  at 


1 
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Similar  but  more  complicated  characterizations  can  be  obtained  for 
conditionings  at  intermediate  times.  Now  let  a  be  the  inverse  of  the 

matrix  a.  Introduce  the  differential  operator 

Re(a,b)j  =  b^)  +  ^  <*jk  bk  +  a^k  Dj(«k^  ^  * '  •>  n> 

mapping  a  C00  matrix— vector  pair  into  a  C00  vector  field.  We  can  now 
make  the  following  statement. 

Theorem  1  Let  X  be  a  Markov  diffusion  with  a  continuous  positive  initial 
density,  a  diffusion  matrix  a(t,y)  and  drift  vector  b(t,y).  Let  Y  be  a 
governed  reciprocal  diffusion  with  semimartingale  characteristics  a(t,y) 
and  cx(t,y).  We  assume  that  the  components  of  a,  b,  a  and  c  are 

and  that  a  and  a  are  uniformly  elliptic.  Let  0  <  s  <  u  <  1.  Then  X 
and  Y  possess  the  same  reciprocal  structure  on  [s,u]  if  and  only  if  the 
following  functions  of  (t,y)  are  equal  on  [s,u]  x  !Rn 

i)  a  =  a 

ii)  For  all  x  c  IRn,  i,  j  =  1,  2,  •  •  •  n. 

Dj(0jk  bk)  -  Dj(a|k  bk)  =  Di(ajk  ck)  -  Dj(3jk  ck) 

iii)  For  all  x  dRni  =  1,2,  •••  n. 

Re(a,b)j  =  Re(a,  cx)s 


Remark  1)  This  result  is  close  to  being  a  corollary  of  a  remarkable 
short-time  asymptotic  expansion  of  Krener  (19SSb)  for  the  reciprocal 
transition  density  (P(x^  (  dy|xt_k  =  x,  xt+k  =  z)  of  a  Markov  diffusion 

that  parallels  the  corresponding  expansions  of  Azencott  (19S1)  and 
Molchanov  (1975)  for  Markov  transition  densities.  However,  the  expansion 
is  quite  complicated  and  I  have  not  used  it  in  the  proof  as  the  following 
argument  is  more  direct. 

2)  The  theorem  also  holds  if  the  c  is  replaced  by  the  augmented 
semi i nar tiugale  characteristics  e  (l.,v)  provided  all  tin*  fum  ;  ions  aie 


s 


restricted  to  0  <  t  <  1. 

3)  For  simplicity  let  /£  be  the  function  for  i  =  1,  — ,  n 

and  let  P q  be  the  function  —  l/2(b^  b^  +  a^  b^)).  Then  the 

left  member  of  ii)  is  D-/3j  —  Dj/?-  and  that  of  iii)  is  Dq/L  —  Dj/?q.  Then  the 

theorem  states  that  the  n(n  4-  l)/2  functions  —  Dj/3-,  i,  j  =  0,  1,  •  •  •, 

n,  i  <  j,  and  the  n(n  +  l)/2  components  of  the  matrix  a  (or  more 
pedantically,  their  assignments  to  the  probability  law  of  X)  form  a  set  of 
local  reciprocal  characteristics.  Clearly  these  characteristics  are  not 
independent;  in  fact,  in  differential— geometric  terms,  the  D-/?-  —  D-/L  are 

the  components  of  the  exterior  derivative  of  the  1— form  (/3q,  /?p  •  •  • ,  Pn). 


Proof  of  the  'only  if'  part.  Without  loss  of  generality  we  may  take  s  =  0, 
u  =  1.  Assume  that  X  and  Y  possess  the  same  kernel  Pq^  on  S? .  Then 

Y  is  governed  by  X  and  by  the  previous  argument  it  follows  that,  for 
some  positive  continuous  X— space  time  harmonic  function  hx(t,y), 

a(t,y)  =  a(t,y)  and 

c*(t,y)  =  b'(t,y)  +  a^(t,y)  D-  log  hx(t,y) 


Let  /?j,  i  =  0,  1,  •  •  *,  n  be  the  terms  introduced  in  Remark  3  and  let  /L  be 
the  corresponding  terms  (note  a  =  a). 


\  =  °ij  ci '  =  ’■  -  -  A  =  -  M  “jk  4  + a)K  uf\c  <» 


Jk 


(From  now  on  we  generally  omit  the  subscript  x).  Then  /?•  —  /£  =  D- 

log  hx  for  i  =  1,  •  •  •  n.  If  we  can  show  that  in  addition  —  P^  — 

log  h ^  then  —  P ^  is  a  gradient  vector  and,  by  Remark  3,  ii)  and  iii)  will 

follow  immediatclv.  From  ( 1 )  (.  =  lor?  h  satisfies 

^  x 


9 


D0<  =  -b'D^t  j  +  Djfly ) 

=  -  »ij  »Pi  ~  P,)  -  j  »ij(Di(Sj  -  0j)  +  (P,  -  Pj)(Pj  ~  Pj» 

=  -5»iilVi-Vj  +  DiC3j-^j)) 

=  0o-4>> 

So  /?  —  0  is  a  gradient  on  [0,1]  x  Kn  and  the  proof  is  complete. 

The  'if  part.  Assume  the  identities  hold  for  the  pairs  (a,b)  and  (a,  c  ). 

To  show  that  X  and  Y  have  the  same  reciprocal  structure  it  suffices  to 
show  that  (1)  holds  for  some  X-space-time  harmonic  function  hx(t,y) 

with  hx(0,x)  =  1.  Now  for  each  x  the  vector  function 

0x  i^y)  ~  P\(^y))  is  by  ii)  and  iii)  the  gradient  of  some  smooth  function 

<p  on  [0,1]  x  IRn.  Then  it  follows  from  a  reversal  of  the  derivation  above 
and  the  definition  of  /?x  Q  and  /?Q  that 

D0  +  *>'  Di  +  k  +  DADA)  =  "• 

Set  hx(t,y)  =  exp[^»x(t,y)  —  <£x(0,x)].  Then  hx  is  a  positive  smooth 
X-space— time  harmonic  function  with  the  properties  required. 

PROBABILISTIC  INTERPRETATIONS 
The  previous  theorem  is  purely  analytical  in  nature  and  sheds  no 
light  on  the  probabilistic  eaning  of  local  reciprocal  characteristics.  In  this 
section  we  give  two  probabilistic  formulae  in  which  they  occur  naturally. 
They  are  formulated  for  Markov  diffusions.  Theorem  1  makes  it  clear  how 
they  can  be  extended  to  non— Markovian  reciprocal  diffusions. 

Suppase  X  =  ({x^},  O’)  with  diffusion  a  and  drift  b  is  the  Markov 

diffusion  satisfying  the  conditions  of  Theorem  i.  Let  X  =  ({x(}.  IP)  be  the 

Markov  diffusion  with  diffusion  a  and  zero  drift.  Pqx(C.v).  p^  (t,v) 

i lie  corresponding  transition  <l<‘nsii i<>s.  Lot  ll'„  II'! 1  IF'  *n<|  |J’lz 

Ox'  0\'  Ox  0\ 


arc 
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respectively  denote  narrowly  continuous  versions  of  P(-  |  Xq  =  x), 

P(.  |  x0  =  x,  Xj  =  z),  etc.  Pq,  /?j,  •  •  •  Pn  are  defined  as  before.  The  first 

1  z 

formula  is  a  factored  version  of  Girsanov's  formula  for  Pqx. 

Theorem  2  There  is  a  random  variable  S,  defined  a.s.  Pq^— uniquely  for 
all  x,  z  (  IRn,  and  a  function  k  continuous  on  IR^n  such  that 

s  =  exp  Jo  Jo  (EV  ^(ft*  €Xt) "  Vft’  ext})  fdc(xt  K -  xt  ^ 


and  such  that  for  almost  all  z  (Lebesgue). 


a.s. 


where  J  •  •  •  denotes  a  Stratonovich  integral  (convention  of 

Rogers-Williams  (1987)),  the  summation  indices  /r,  v  range  over 
0,  1,  2,  •  •  - ,  n,  and  xP  =  t. 


Remark  1  The  integrand  of  the  double  integral  in  S  is  precisely  the  "exact 
2— form"  made  up  of  the  reciprocal  characteristics  given  by  Theorem  1. 
Notice  that  S  does  not  depend  on  x  and  z. 

2)  There  are  many  similar  expressions;  this  is  just  one  of  the  simplest.  The 
double  integral  is  given  by  a  Stokes  formula  on  the  fan— shaped  surface 
obtained  by  subtending  the  path  (t,  xy  0  <  t  <  1}  at  the  origin.  Any 

surface  stretched  between  the  path  and  a  piecewise  smooth  curve  beginning 
at  Xq  and  ending  at  x^,  and  depending  only  on  Xq  and  could  also  be 

used. 

8)  It  is  highly  likely  that  the  phrase  "for  almost  all  /."  can  Ik*  (hopped, 
but  I  have  not  proved  this. 


11 


Proof  Let  G  be  the  Girsanov  formula  for  d  •Vd  Pqx;  in  terms  of  the 
P-x,  i  =  1,  •  •  *,  n,  this  is 

fi  •  i  rl 

G  =  exp  (  P-  dx'  -  p.  a1J  p.  dt 

Jq  1  1  ^Jo  1  J 

=  /?j(t,  xt),  etc.,).  G  and  the  other  stochastic  integrals  that  we 

introduce  can  be  so  chosen  that  they  are  defined  a.s.  IPq^— uniquely  for  all 

x,  z  e  Rn.  Now  transform  G  into  Stratonovich  form  and  than  expand  the 
resulting  line  integral  by  a  stochastic  form  of  Stoke's  theorem.  These  are 
familiar  steps  in  stochastic  differential  geometry;  see  for  instance  Meyer 
(1982,  p.  202)  for  the  former  and  Bismut  (1981,  p.  208)  for  the  latter.  So 

fl  -ir1  •• 

G  =  exp[  p.  dx\-±  (  aJJ  D.  p.  +  p.  a1J  /?.)dt] 

Jo1  ^  z  Jq  J  1  1  J 

rl 

=  exp  P  dx^ 

J0  A*  1 

Now 

0  (<t,  fxt)  axj|  t=i  =  Jo  |o  txt)  +  t  D/^et,  (Xt)x^)dt  dx! “ 

Making  use  of  a  stochastic  Fubini's  theorem  (Ikeda  and  Watanabe 
19S1)  we  have 


and  on  combining  these  we  find 
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J0  k  (*■  xt'a<t = I0(V(’  “d  xf  ~  V0,  "oK> d< 

+ J0  Jo  DiA(rt’  £Xd  (xt  ^  -  *f  3x^£d< 

=  I(x0,x,)  +  J  (  =  log  G) 

On  rearranging  the  double  integral  we  find  S  =  ed.  For  all  A  e 
B  c  J?(IRn),  we  have  the  identities 


where  kQ  =  Pqx(1,z)/pqx(1,z).  Hence  for  almost  all  z,  and  all  A(5, 
P0x(A)  =  |AGdffT0xk0^’ 

and  so 

d!?1  z 

“JT T  =  G  k0('  -z>  =  s  ci(X°'  Xl)k0(x’z> 

Ox 

Note  that  x0  =  x  and  Xj  =  z  a.s.  O7^.  Setting  k(x.z)  =  e^X;Z^  k0(x,z) 
then  gives  us  the  result. 

Krener's  postulate  (1)  is  quite  subtly  defined  in  that  the 
conditioning  variables  slide,  as  the  limit  is  taken,  along  geodesics  of  the 
Riemannian  metric  a--..  The  following  formula  can  be  thought  of  ns  an 

integral  variation  of  (1)  conditioned  more  conventionally  on  the  members  of 
a  partially  nested  family  of  o  fields.  It  is  stated  without  proof  for  the  ense 
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with  constant  diffusion  matrix  a.  A  full  treatment  of  the  general  case, 
which  requires  the  introduction  of  the  stochastic  differential  geometric 
concepts  of  stochastic  development  and  parallel  translation,  will  be 
presented  elsewhere. 

It  is  convenient  to  introduce  the  following  definition.  Given  the 
set-up  (ft,  {^},  {#h,  IP)  we  shall  say  that  a  continuous  process  Z 

S  S  v 

is  a  free  motion  if  for  all  0  <  s  <  t  <  u  <  1 


E[Z 


_  u 
t  u 


t  rj  t 

s  ^s  u 


|  Z  I  $  “]  =  0 
s  u1  SJ 


If  we  set  r  =  t  —  h,  u  =  t  +  h  then  we  see  that  this  definition  essentially 
says  that  the  mean 'acceleration' of  Zfc  (conditioned  on  is  zero. 

(cf.  martingales;  for  these  the  predicted  'velocity'  is  zero)  For  example,  if 
X  is  a  Brownian  motion,  then  Z^  =  x^  +  k(xQ,  x^)t  is  a  free  motion.  If  a 

free  motion  Z  is  adapted  to  {#[}  then  it  is  a  continuous  semimartingale; 

I  have  not  explored  its  other  properties.  However,  in  the  following 
particular  case  the  martingale  part  of  the  free  motion  is  a  Brownian  motion. 
If  X  is  Gaussian  Z  can  be  given  a  more  precise  description  (see  Frezza, 
Krener,  Levy  1989). 

Theorem  3  With  the  definition  of  Theorem  1,  if  [a1^]  is  a  constant  matrix, 
then  the  process 


71  _  i  iJ 

Lt  t 


[(Dk  0.  -  Dj  /y&cjj  +  (D0  0.  -  D.  0O)  dr]  ds  (2) 


is  a  free  motion,  where  D ,0.  =  D./?-(r,  x  ),  etc.,  and  j,  k  are  summed 

k  j  k  J  1 

over  1,2,  Alternatively,  for  any  0<s<t<u<l 


iv  x. 


+  a'J 


t  i 

x 


s  s  u 


t  -  s  i 

X 


S  II 


.  ('V'i 


Djrfn).|r] 


vJI, 
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where  7gU(r)  =  ^ for  r  ^  * 

=  (u  7  OCt  ril  for  r  >  t. 
(U  -sj 


Remarks  1)  Following  Krener's  point  of  view,  we  can  formally  but  usefully 
rewrite  the  integral  equation  (2)  as  a  second  order  stochastic  differential 
equation. 

3%  =  -  Dj/y  3k*  dt  +  -  Dfo)  dt2  +  3%  (3) 

where  (P  etc.  is  to  be  thought  of  as  an  infinitesimal  central  second 
difference,  with  the  usual  proviso  that  (3)  is  no  more  than  a  mnemonic  for 
(2)- 

2)  A  comparison  with  Krener’s  postulate  (1)  suggests  that  the  coefficient  of 
<2xdt  in  (3)  should  be  g  in  (1)  and  that  of  dt^  should  be  f.  This  is 
certainly  so  for  Gaussian  X,  but  for  nonGaussian  processes  the  agreement 
has  still  to  be  verified. 
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The  Fractional  Representations  of  a  Class  of  Nonlinear  Systems 

Arthur  Kroner,  Yi  Zhu 
Department  of  Mathematics 
University  of  California  at  Davis 
Davis,  California  95616,  USA 

Abstract  Right  and  left  coprime  fractional  representations  are  shown  to  exist  for 
a  special  class  of  nonlinear  systems  which  have  both  controller  and  observer  forms. 
Furthermore,  a  generalized  Bezout  identity  is  given  for  this  class  of  nonlinear 
*  *♦. sterns.  . 

LIntroduction  The  purpose  of  this  paper  is  to  obtain  the  right  and  the  left  coprime  factorizations 
of  a  class  of  nonlinear  systems  which  have  both  controller  and  observer  forms  and  to  prove  that  a 
generalized  Bezout  identity  holds  for  this  class  of  nonlinear  systems. 

The  fractional  representation  method  of  linear  systems  can  be  traced  back  to  the  early  1970’s 
when  Rosenbrock[16]  used  transfer  matrix  to  study  multi-input/multi-output  systems.  Since  then, 
this  method  has  been  studied  intensively  by  many  researchers[4],  [7],  [1 1],  Desoer[3]  generalized 
the  general  concept  of  coprimeness  to  a  ring.  Recently,  Hammer  [9]  used  the  coprime  factorization 
approach  to  tackle  the  discrete  nonlinear  systems,  and  there  is  a  growing  interest  in  applying  the 
coprime  factorizations  approach  to  study  nonlinear  systems. 

In  this  paper,  we  use  the  state  space  descriptions  as  realizations  of  a  nonlinear  mapping  so  that 
we  can  work  on  both  state  space  and  input-output  mapping  just  as  one  does  in  the  linear  situation. 
Our  approach  is  based  on  the  nonlinear  normal  form  theory  of  Krener[13].  Using  the  controller 
form  of  a  nonlinear  system,  it  is  possible  to  design  a  nonlinear  feedback  controller.  The  given 
system  can  be  right  factorized  into  a  composition  of  a  stable  postprocessor  and  an  inverse  of  a 
stable  preprocessor.  The  right  coprimeness  concept  is  based  on  this  right  factorization.  If  we 
combine  the  postprocessor  and  the  preprocessor,  they  form  a  higher  dimensional  system.  The 
existence  of  a  stable  left  inverse  of  this  higher  order  system  is  our  definition  of  right  coprimeness. 
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This  follows  Hammer[9].  In  a  similar  way,  we  start  from  the  observer  form  of  a  nonlinear  system 
used  to  design  a  nonlinear  asymptotic  observer.  A  modification  of  this  gives  a  way  to  left  factorize 
the  given  system  into  a  composition  of  an  inverse  of  a  stable  postprocessor  and  a  stable 
preprocessor.  If  we  put  them  together  to  form  a  higher  dimensional  system,  the  existence  of  a 
stable  right  inverse  of  this  higher  order  system  is  our  definition  of  the  left  coprimeness.  We  shall 
give  the  rigorous  definitions  later  on.  It  turns  out  that  for  those  nonlinear  systems  which  have  both 
controller  and  observer  forms  the  development  of  right  coprime  factorizations  is  a  rather 
straightforward  generalization  of  the  linear  theory  but  the  development  of  left  coprime 
factorizations  differs  with  substantially  from  the  linear  situation.  The  nonlinear  right  and  left 
coprime  factorizations  satisfy  a  generalized  Bezout  identity. 

This  paper  is  organized  as  following.  We  first  talk  about  two  normal  fonms  of  nonlinear  sys¬ 
tems  briefly,  and  then  discuss  right  and  left  factorizations.  Finally,  the  generalized  Bezout  identity 
is  given. 

2.Normal  forms  This  work  is  based  on  nonlinear  normal  form  theory.  A  complete  discussion 
of  nonlinear  normal  form  theory  is  beyond  of  the  scope  of  this  paper.  We  only  describe  those 
aspects  of  controller  form  and  observer  form  which  we  are  going  to  use  in  this  paper.  A  more  de¬ 
tailed  discussion  can  be  found  in  Kroner's  paper[13]. 

Consider  the  following  nonlinear  system 


(2.1a) 

%  =  f(5)  +  g(5)u 

(2.1b) 

/ — \ 

ll 

>> 

(2.1c) 

4(0)  =  4° 

where  f:  Rn  — >  Rn,  g:  Rn  — >  Rmxn,  h:  Rn  — >  Rp  are  all  smooth(C°°)  functions.  Assume  that 
f(0)  =  0  and  h(0)  =  0.  We  seek  a  local  change  of  coordinates  x  =  x(£)  under  which  (2.1)  has  a 
simpler  form  in  some  neighborhood  of  the  norminal  £°=  0.  In  this  paper  we  shall  restrict  our 
attention  to  the  changes  of  coordinates  whose  Jacobian  at  the  £°=  0  is  the  identity 
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—  (0)  =  I. 

as 


Such  changes  of  coordinates  can  be  written 
(2.2)  S  =  x  +  «*x), 


where 


4*0)  =0,  ^(0)  =  0. 
ax 


They  leave  invariant  the  first  order  linear  approximation  to  (2.1) 

s2 


(2.3a) 

(2.3b) 

(2.3c) 

where 

(2.4) 


z  =  Az  +  Bu  +  0(z,u) 
y  =  Cz  +  O(z)2 
Z(0)=S(0) 


A  =  §(0),  B  =  g(0),  C  =  ^(0) 

as  as 


Controller  form 

We  say  system  (2.1)  admits  a  controller  form  if  there  is  a  change  of  coordinates  (2.2)  and  a 
controllable  pair  (A3)  such  that  (1)  can  be  transformed  into  the  following  form 
(2.5a)  x  =  Ax  +  Bu  +  Bac(x)  +  BPc(x)u 

(2.5b)  y  =  Cx  +  yc(x) 


where 

ac(x)  =  O(x)2,  (5c(x)  =  O(x),  yc(x)  =  O(x)2, 

and  A  and  B  are  constant  matrices  defined  in  (2.4).  ac(x),  (5c(x),  and  yc(x)  are  all  arbitrary  smooth 
matrix-valued  functions  of  dimension  mxl,  mxm,  and  pxl  respectively.  Furthermore  the  matrix  I 
+  Pc(x)  must  be  invertible  at  each  of  interest. 

Controller  form  is  useful  in  designing  nonlinear  feedback  controllers.  If  we  let 
u  =  u  +  ac(x)  +  pc(x)u, 

then  (2.5a)  becomes  a  linear  system 

x  =  Ax  +  Bu. 
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Since  (A,B)  is  a  controllable  pair  so  that  we  can  find  a  matrix  F  such  that  A+BF  is  stable.  If  we 
apply  the  feedback  u  =  Fx  +  v  or  equivalently 

(2.6)  u  =  [  I  +  Pc(x)]  *(  Fx  —  ac(x)  +  v ), 

then  the  closed-loop  system  is  a  stable  system 

(2.7)  x  =  ( A  +  BF)x  . 

The  nonlinear  feedback  control  law  (2.6)  is  used  to  cancel  the  nonlinearity  of  the  given  system. 
We  see  from  it  that  the  invertibility  of  I  +  J3c(x)  is  necessary  to  guarantee  the  existence  of  the  non¬ 
linear  feedback  (6). 

Observer  form 

The  system  (2.1)  is  said  to  admit  an  observer  form  if  there  is  a  change  of  coordinates  (2.2)  and 
an  observable  pair  (CA)  such  that  (1)  can  be  transformed  into  the  following  form 


(2.8a) 

x  =  Ax  +  Bu  +  a0(y)  +  (30(y)u 

(2.8b) 

y  =  y  +  Y0(y) 

(2.8c) 

y  =  Cx. 

where 

ao(x)  =  O(x)2,  PQ(x)  =  O(x),  yo(x)  =  O(x)2, 

and  A  and  C  are  constant  matrices  defined  in  (2.4).  ao(x),  (3c(x),  and  yo(x)  are  smooth  matrix-val¬ 
ued  functions  with  dimensions  nxl,  nxm,  and  pxl.  They  are  arbitrary  except  x  +  yo(x)  must  be  lo¬ 
cally  invertible  function.  Notice  that  the  change  of  coordinates  is  typically  different  from  the  one  in 
controller  form. 

Observer  form  is  useful  in  designing  asymptotic  observers.  Now  introduce  an  observer 

(2.9)  x  =  Ax  +  a0(y)  +  Pc(y)u  -  Hty  -  Cx). 

_  A 

If  we  denote  x  =  x  -  x,  then  the  error  satisfies 

(2.10)  x  =  ( A  +  HC)x  . 

Since  the  pair  (A,C)  is  observable,  we  can  choose  a  matrix  H  such  that  A+HC  is  a  stable  matrix, 
and  then  (9)  is  an  asymptotic  observer  of  (8). 


If  a  nonlinear  system  can  be  transformed  into  controller  form  and  observer  form  ( typically  in 
two  different  coordinate  systems ),  then  we  can  design  an  observer-based  controller  just  like  we  do 
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for  linear  systems. 

The  question  here  to  ask  is  that  under  what  conditions  the  nonlinear  system  (2.1)  can  be  trans¬ 
formed  into  controller  form  (5)  and  observer  form  (8),  and  for  what  conditions  the  pairs  (A,B)  and 
(C,A)  of  the  linearization  are  controllable  and  observable  pairs  respectively.  The  answer  is  the 
nonlinear  system  (1)  must  be  controllable  and  observable  in  the  nonlinear  sense  and  satisfy  certain 
additional  conditions.  Again  readers  who  are  interested  in  the  details  should  refer  to[13]. 

3.Right  fractional  description  Suppose  we  have  a  nonlinear  system  in  controller  normal 
form 

(3.1a)  xc  =  Axc  +  Bu  +  B(ac(xc)  +  (5c(xc)u) 

(3.1b)  y  =Cxc  +  yc(xc) 

(3.1c)  xc(0)  =  x° 

where  the  c-subscripts  indicate  coordinates  and  functions  associated  to  controller  normal  form.  We 
view  (3.1)  as  defining  an  input/output  map 
(3.2a)  G:  u(  )  i->  y(  ) 

from  functions  u(t)  to  y(t)  for  t  >  0.  We  seek  a  right  factorization  of  G 
G  =  NoM"1 

where  N  and  M  are  input/output  maps 
(3.2b)  M:  v(  )  i-»  u(  ) 

(3.2c)  N:  v(  )  i->  y(), 

M  is  invertible  and  o  denotes  composition.  Among  others,  Khargonekar  and  Sontag[12],  Doyle[5] 

Francis[6,7],  Sontag[12]  have  treated  such  factorization  of  linear  systems  and  Hammer[9]  has  dis¬ 
cussed  similar  ideas  for  nonlinear  discrete  systems.  We  shall  follow  these  authors,  particularly 
[5,7], 
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To  describe  the  input/output  maps  M  and  N  we  shall  use  a  state  space  realization.  In  particular 
we  define  M  to  be  the  input/output  map  of 
(3.3a)  4C  =  (A  +  BF )  4<.  +  Bv 

(3.3b)  ac(4c)  +  (I  +  Pc(4c)  )u  =  +  v 

(3.3c)  ^c(0)=0 

where  (3.3b)  defines  u  as  a  function  of  4C  and  v. 

We  consider  the  decomposition  N  =  GoM,  which  is  realized  by  the  2n  dimensional  system 

(3. 1,3.3)  described  in  4c,  xc  coordinates.  Let  e  =  xc  -  4<.  then 

(3.4)  i  =  Ae  +  B(-F^  -v  +  ac(xc)  +  ( I  +  pc(xc))(  I  +  Pe(yf'(  F^-a^+v)  ) 

If  e(t)  =  0  then  e(t)  =  0.  Since  e(0)  =  0  we  conclude  that  e(t)  =  0  for  t  >  0.  In  other  words,  the  re¬ 
alization  ( )  of  N  is  not  controllable  because  e(t)  is  unaffected  by  the  input  v(t). 

A  controllable  realization  of  N  is 


(3.5a) 

(^(A  +  BFJt.  +  Bv 

(3.5b) 

y  =CtK+yc(tK). 

(3.5c) 

© 

II 

S 

Hence  we  conclude  that  G  =  NoM-1  where  N  and  M  are  realized  by  (3.5)  and  (3.3).  Notice  that 
M  is  invertible  since  ( I  +  (ic)  *s  invertible  by  assumption. 

Notice  also  that  if  (A,B)  is  a  controllable  pair  then  we  can  choose  F  so  that  (3.3)  and  (3.5)  are 
stable  systems.  Hence  we  have  factored  G  over  the  ring  of  stable  nonlinear  systems.  We  are  being 
deliberately  vague  about  the  precise  definition  of  a  stable  nonlinear  system.  It  is  clear  that  (3.3, 3.5) 
are  "stable"  under  any  reasonable  definition. 

Of  course,  we  are  interested  in  coprime  factorizations  over  the  ring  of  stable  nonlinear  systems. 

Again  we  shall  not  try  to  make  this  concept  precise  but  following  Hammer{9]  and  others  we  shall 
say  that  G  =  NoM-1  is  a  coprime  factorization  if  there  exists  P,  the  input/outpui  map  of  a  stable 


system, 

(3.6a) 


P:  [“]  r*  w 
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such  that  the  composition 

(3.6b)  P°[n]:  vi^  [y]'^w 

is  the  identity,  w  =  v. 

The  input/output  map  of  can  be  realized  by  an  n  dimensional  system 

(3.7a)  =  (A  +  BF )  +  Bv 

(3.7b)  ac(4c)  +  ( I  +  Pc(^c)  >u  =  F^c  +  v 

(3.7c)  y  =  C4c  +  Yc(4c) 

(3.7d)  4c(0)  =  0. 

One  left  inverse  of  (3.7)  is  realized  by 

(3.8a)  zc  =  Azc  +  Bu  +  B(ac(zc)  +  pc(zc)u) 

(3.8b)  w  =  ac(zc)  +  ( I  +  |3c(zc)  )u  -  Fzc 

(3.8c)  z(0)  =  0. 

If  e  =  4c  ~  zc  then 

e  =  Ae  +  B(ac(4c)  -  ac(xc)  +  (  PC(Q  -  Pc(xc))u  ) 

If  e(t)  =  0  then  e(t)  =  0  and  since  e(0)  —  0  it  follows  that  e(t)  =0  for  all  t  £  0.  If  e(t)  =  ^(t)  -  zc(t)  = 
0  then  w(t)  =  v(t)  for  all  t  >  0  so  (3.8)  is  inverts  (3.7). 

However  we  do  not  know  that  (3.8)  is  stable.  To  insure  the  stability  of  (3.8),  we  must  add  to 
(3.8a)  an  extra  term.  This  term  must  stabilize  (3.8)  and  must  be  zero  when  4C  =  zc  so  that  (3.8) 

remains  a  left  inverse  of  (3.7).  How  do  we  find  such  a  term? 

Notice  that  the  dynamics  (3.8a)  is  the  same  as  the  dynamics  of  the  original  system  (3.1a)  and 
notice  that  the  other  y  of  (3.7)  does  not  appear  in  (3.8).  Perhaps  we  can  inject  y  into  (3.8a)  to 
stabilize  it?  This  is  more  or  less  equivalent  to  asking  whether  output  injection  can  be  used  to 
stabilize  the  original  system  (3.1).  This  is  always  possible  for  systems  in  observer  form,  hence  we 
assume  that  there  exists  a  change  of  coordinates 

(3-9)  Xc=Xo  +  <l)co(Xo) 

satisfying  (2.2)  transforming  (3.1)  into  the  observer  form 


(3. 10a)  x0  =  Ax0  +  Bu  +  a0(Cx0)  +  ^(Cx^u 

(3.10b)  y  =Cx0  +  y0(Cx0) 

(3.10c)  xo(0)  =  xo°. 

Suppose  we  consider  a  similar  change  of  coordinates  for  (3.8) 

(3.11)  zc  =  z„  +  <Uzo> 

to  obtain 

(3.12a)  z0  =  Azc  +  Bu  +  a0(Cz0)  +  (30(Czo)u 

(3-  12b)  w  =  ac(z0+  (j)co(z0))  +  ( I  +  Pc(z0+  (})co(z0))  )u  -  F(z0+  <{)co(z0)) 

(3.12c)  zo(0) =  0 

We  add  to  (3.12a)  the  term 

(3. 13a)  aQ(y)  -  ao(Cz0)  +  (pG(y)  -(50(Cz0))u  +  H(Czo  -  y ) 

on  the  right  hand  side  of  (3.14a)  to  obtain 

(3. 12aa)  z0  =  (A  +  HC)z0  +  Bu  +  0to(y)  +  pQ(y)u  -  Hy 

where  y  is  a  function  of  y  of  (3.7c)  defined  by 
(3.13b)  y  =  y  +  Y0(y)=  C£0  +  y0(C^) 

and  ^  is  the  state  of  (3.7)  in  observer  coordinates 
(3.13c)  4c  =  5o  +  <KU 

Notice  that  (3.13a)  is  zero  whenever  =  zo,  hence  the  input/output  map  P  of  the  (3.12aa,  b, 
c)  is  also  an  inverse  of  (3.7).  Also,  if  (C,A)  is  an  observable  pair  then  we  can  choose  H  so  that 
(3.12aa,  b,  c)  is  stable. 

Now  we  summarize  the  analysis  into  the  following  right  factorization  theorem. 

Theorem  If  the  nonlinear  system  (2.1)  admits  controller  form  (3.1)  and  observer  form 
(3.10),  then  there  exist  stable  mappings  M:  vi — >  u  and  N:  vi — >y,  where  M  is  invertible,  such 
that  the  input/output  mapping  G  defined  by  the  system  can  be  factorized  as  G=N°M  \  And  fur¬ 
thermore,  M  and  N  are  right  coprime,  that  is,  the  mapping  :  vi — >  [yj  has  a  stable  left  in¬ 
verse  P:  [y]  i — >v  such  that  the  composition  P°  is  identity  mapping. 
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4.Left  fractional  description  In  linear  system  theory,  observability  is  dual  to  controllabil¬ 
ity.  Unfortunately  the  analogy  is  not  so  straightforward  for  coprime  factorizations.  The  theory  of 
right  coprime  factorization  of  nonlinear  systems  is  very  similar  to  the  theory  for  linear  systems  but 
theory  of  left  coprime  factorizations  for  nonlinear  systems  has  some  substantial  differences  with 
the  linear  theory. 

We  start  with  a  system  in  observer  form  (3.10)  realizing  an  input/output  map  G.  We  define  an¬ 
other  input  output  map 
(4.1)  M:  — >w 

by 

(4-2a)  =  (A  +  HC)40-  Hy  +  aoft)  +  |30(y)u 

where  y  is  an  invertible  function  of  the  input  y  defined  by 
(4.2b)  y  =  y  +  Yc(y) 

and  the  output  is 

(4.2c)  w  =  -C^  +  y 

(4.2d)  ^(0)  =  0 

Consider  the  serial  connection  of  (3.10)  and  (4.2),  this  is  not  a  realization  of  the  MoG  but  it  is  a 
re2- ration  of  N  =  M°£q  J.  ( This  is  the  first  important  difference  with  the  linear  theory).  If  we 

define  C0  =  xo  “  ’  l^en  N  is  realized  by 

(4.3a)  C0  =  (A  +  HC)C0  +  Bu 

(4.3b)  w  =  CC0 

(4.3c)  C0(°)  =  0 

because  in  40,  xQ  coordinates  for  (3.10,  4.2)  only  the  are  observable  from  the  output  w.  We 
consider  N,  M  as  a  left  factorization  of  G,  although  it  is  really  a  left  factorization  of  ]  in  the 


sense  that 
(4.4) 


M”[r.]  =  n 
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Notice  that  we  cannot  compose  this  on  the  left  with  M~  since  M  is  not  invertible  as  a  map¬ 
ping  from  J  to  w. 

Perhaps  the  best  way  of  viewing  the  situation  is 

(4-5a)  [I  IsHcHs 

or 

rI_i  r 1  0  ]-1  rn 

<4-5b)  U-lo  mJ  °[nJ 

The  matrix  notation  is  somewhat  misleading  because  M  depends  on  both  u  and  y. 

In  any  case,  if  (C,A)  is  an  observable  pair  then  (4.2)  and  (4.3)  can  be  made  stable  by  proper 
choice  of  H.  In  particular,  the  nonlinearities  in  (4.2)  are  momeryless  functions  of  the  inputs  u  and 
y  hence  (4.2)  is  BIBO  stable. 

Next  we  address  the  input/output  map 
(4.6a)  [-N,  m]:  [y] ' — >w 

where  again  the  matrix  notation  is  somewhat  misleading  since  both  u  and  y  are  inputs  to  M,  i.e., 
(4.6b)  w=-N(u)  +  M  [“]. 

This  input/output  map  can  be  realized  by  an  n  dimensional  system 
(4.7a)  =  (A  +  HC)^o+  ao(y)  +  Bu  +  p0(y)u  -  Hy 

where  y  is  an  invertible  function  of  the  input  y  defined  by 
(4.7b)  y  =  y  +  Y0(  y) 

and  the  output  w  is  given  by 
(4.7c)  w  =-C^Q  +  y. 

We  wish  to  find  an  input, 'output  map  P  realized  by  a  stable  system  so  that  P  is  a  right  invertible  of 
[-N,  M  ]> 

(4.8a)  P  :  vi— >[y] 

(4.8b)  [-N,  m]oP  :  vl-»  w  =  v. 

We  start  by  constnicting  an  inverse  for  (4.7), 
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(4.9a)  zc  =  Azc  -  Hv  +  Bu  +  ao(y )  +  p0(y)u 

(4.9b)  y  =  Cz0  +  v 

(4.9c)  y  =  y  +  Y0(y) 

(4.9d)  u  —  ? 

(4.9e)  zo(0)  =  0 

We  leave  the  u,  which  also  appears  in  the  dynamics  (4.9a), unspecified  for  the  moment  Notice  that 
if  e  =  £0—  zQ  is  the  error  between  the  states  of  (4.7)  and  (4.22)  then  e  =  0  whenever  e  =  0.  Since 

e(0)  =  0  we  conclude  that  e(t)  =  0  for  all  t  >  0  and  so  by  (4.7c)  and  (4.9b)  we  have  w(t)  =  v(t).  In 
other  words  (4.9)  is  a  right  inverse  of  (4.7). 

What  about  stability  of  (4.9)?  We  would  like  to  choose  the  output  u  in  such  a  way  that  (4.9a)  is 
stable  in  some  sense.  If  we  ignore  the  -Hv  term  of  (4.9a)  this  looks  like  the  original  system  is  in 
observer  form.  This  is  not  exactly  true  because  y  is  defined  by  (4.9b)  by  a  change  of  coordinates 

(3.9).  If  we  apply  a  similar  change  of  coordinates  (3.1 1)  to  (4.9)  we  obtain 

•  di> 

(4. 10a)  zc  =  Azc  +  Bu  +  B(ac(zc)  +  pc(zc)u)  -  ( I  +  — ss(zo))(Hv  )+ 

9z0 

3<b 

( I  +  T^z0)){[a0(Cz0+  v  )  -  ao(Czo)]  +  [P0(Czo+  v  )  -  po(Cz0)]u} 

3z0 

Suppose  we  choose  an  F  such  that  (A  +  BF)  is  stable  and  define  u  by 
(4.9dd)  ac(zc)  +  (I  +  Pc(zc))u  =  Fzc. 

The  u  in  (4.9d)  is  chosen  in  (4.9dd). 

When  the  input  v  =  0,  (4.10a)  becomes 
(4.10b)  zc  =  (A  +  BF)zc. 

We  view  (4.10a),(4.10b),(4.9b)  and  (4.9c)  as  a  realization  of  P  :  vi — then  P  is  a  right  in¬ 
verse  of  [_N,  m] 


We  summarize  the  above  analysis  as  the  following  theorem. 


Theorem  If  nonlinear  system  (2.1)  admits  observer  and  controller  forms,  then  there  exist 


stable  mappings  M 


'u' 

:  p 

-V- 


->v  and  N:  ui — >v  such  that  M0^  * 


[hb  n 


Furthermore  M  and  N  are 


left  coprime  under  the  sense  that  the  left  combined  mapping  [-N,  M  ] :  [y] » — >v  has  a  right  stable 
[y]  suc^  ^at  *he  exposition  [-N,  M  J  °P  is  identity . 


inverse  P:  vi — >1 


S.Bezout  identity  We  conclude  by  noting  that  a  "nonlinear  Bezout  identity"  holds  for  the 
above.  In  other  words  beside  P  being  a  left  inverse  (3.6b)  for  and  P  a  right  inverse  (4.9b) 


for  [-N,  M 

| ,  it  is  also  true  that 

(5.1) 

[-N,  M  ]°[n  ]:  v  [| 

o 

ii 

* 

1 

— t 

and 

(5.2) 

P  o  P:  v  h  w  =  0. 

We  summarize  these  equations  by  (5.3)  in  the  following  theorem. 

Theorem  (nonlinear  Bezout  identity)  The  systems  defined  in  last  two  theorems 
[n]’P’  [  n,m]  ,  and  P  satisfy  the  following  Bezout  identity: 

■  M  i  r  I  0  * 

N  PHo  I  J 
where  ] ,  P,  M  ]  > p  have  realizations  as  discussed. 


(5-3) 


P 

-N,  M 


6.ConcIusion  We  have  briefly  described  an  approach  to  nonlinear  factorizations  based  on  non¬ 
linear  normal  forms.  The  research  is  just  the  beginning.  Many  concepts  are  not  rigorously  defined. 
We  hope  that  this  attempt  will  give  the  nonlinear  system  study  a  push  in  this  direction. 
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OBSERVATION  OF  A  RIGID  BODY  FROM  MEASUREMENT 
OF  A  PRINCIPLE  AXIS* 

Wei  Kang  and  Arthur  J.  Kroner 

Department  of  Mathematics 
University  of  California 
Davis,  CA95616 


The  spacecraft  attitude  control  has  been  studied  before,  see 
[3],  In  this  note,  we  give  a  method  of  observing  the  attitude  of  a 
freely  rotating  spacecraft  by  measuring  one  of  its  principal  axis. 
This  problem  is  solved  in  $2  and  §3.  In  §4,  a  method  of 
determining  the  angular  velocity  by  the  trajectory  of  one  of  its 
coordinates  is  given. 


In  the  following,  we  consider  a  freely  rotating  rigid  body 
with  no  extemel  torques  acting  on  it  Let  {ej, 62,63}  be  a  set  of 
orthonormal  axis  fixed  in  the  spacecraft,  with  the  origin  at  the 
center  of  each  axis  parallel  to  one  of  the  principal  axis.  A 

second  frame  ■( ^^2^3}  is  an  inertially  fixed  basis.  In  page  445 
of  [2],  Symon  describes  the  motion  of  a  freely  rotating  body  as 
follows.  Fix  an  ellipsoid  on  the  spacecraft,  which  can  be 
represented  as 

3  3 

t  XXi€i  I  2>i2  = 

i=l  i=l 

It  is  called  the  inertia  ellipsoid.  There  is  a  fixed  plane,  P,  which  is 
called  the  invariant  plane.  As  the  spacecraft  is  rotating  freely,  one 
can  imagine  that  the  inertia  ellipsoid  is  fixed  on  the  spacecraft  and 
it  is  rolling  on  the  invariant  plane  without  slipping  and  its  center  is 
fixed  at  the  origin,  see  figure  1. 

Suppose  r  is  the  vector  from  the  origin  to  the  point  of 
contact  between  the  ellipsoid  and  the  invariant  plane,  Then,  the 

angular  velocity  <0  satisfies 
o)  =  br 

where  b  is  a  constant.The  following  equations  describe  the 
evolution  of  the  spacecraft’s  attitude 

IjtOj  +  (I3  - 12)02(1)3  =0 

+  ^1 '  ^3)®i®3  =  0 
I3CO3  +  (I2  - 1  j  )co  j  c«>2  =  0 


sinycosG 

V  = — 1 - cot 

sinG  1 


cosycosG^ 

sin0 


is  the  angular  velocity.  The  inertia  tensor  is 


-Ij  0  0 
1=0  I2  0 
0  0  13 


Inertia  Ellipsoid 


t  i- 

1JT7 


Invariant  Plane 


Figure  1.  The  inertia  ellipsoid  rolls  on  the  invariant  plane 


Suppose  the  output  of  system  (1)  is  the  position  of  the 
third  principal  axis,  63(0,  in  inertial  coordinates.  We  consider  the 
observability  of  the  model,  Le,  whether  the  complete  motion  can 
be  determined  from  the  time  history  of  ejft)  in  inertial  coordinates 
and  the  equations  of  motion  (1).  Rewrite  the  system  as  follows: 

(Dj  =  0002(1)3 


d>2=p0)i(03 


(1)3  =  yci)id)2 


-sinyctgG  -cosyctgG  1 


The  inertial  coordinates  of  63(0  are  the  observation  and  are  given 
by 


6  =  cosy©!  -  sinv|/o)2 


V  1  -  cos26sin$~l  [” hj(6,$) 


?  -y  I  -  cos26cos<j>  =  ^2(6,$) 
cos  6  -  _h3(8,$). 


where  ($,  y,  G)  are  Euler  angles  (see  [3]), and 
3 

w  =  Z  “iei 

i=l 

*  Research  supported  in  part  by  AFOSR  85-0267 


ll-*2 


Theorem  l.The  system  (2)  is  observable  iff  a+(3  *  0  and 
(P+l)coj2  +  (a-l)©22  *  0- 


Proof:  To  prove  this  system  is  observable,  using  the 
method  of  [1],  we  need  to  find  the  dimension  of  the  distribution 
C(r)  generated  by  {dy  ,  Lpdy . LjHdy).  Because 

hj  =  ±  -\l  1  -  hj2  +  h22 
the  dimension  of  C(r)  can  be  determined  by: 

dhi.Lpdhj . Lpr**dhj 

dh2,  Lf  dh2 . .  LFr'1dh2 

But  LpKihj,  i=1^2,  are  complicated.  To  determine  the  dimension, 
we  make  a  change  of  coordinates  in  the  output  space.  Let 


3hj  dh] 
3<J>  30 

3h2  3h2 
3$  30 


Then,  det(A)  =  cos0sin0;  and  A  is  nonsingular  whenever  0  *  -j- 
.The  angle  0  depends  on  the  choice  of  the  inertially  fixed  basis. 
We  can  chose  suitable  basis  to  avoid  the  case  0  =  Tp  in  a  local 
neighborhood.Thercfore,  by  change  of  coordinates  in  the  output 
space  we  can  take  0  and  $  the  output.The  observability  of  (2)  is 
equivalent  to  the  observability  with  respect  to  the  output 


By  calculation,  we  know  that 

Lp0  =  ojjcosy  -  o^siny 

Lp<J>  =  (cojsiny  +  o^cosy)-^ 

Lp20  =  sinQcos0(Lp4>)2  -  o^sinOCLp^) 
+  ac^o^cosy  -  Ptuj  cugsiny 


Lp2«t>  =  ctg0(Lj4)(Lp0)  +  — -CLpeLpt})) 


Lp0  siny  cosy„ 

+  OM—1—  + - - 

Jsin0  sinG  sin0 


In  dLp2$  and  dLp20,  all  the  terms  containing  d0,  d$,  dLp$  or 

dLp6  can  be  cancelled,  the  dimension  of  C(3)  is  the  same  as  the 
nude  of  the  following  matrix: 


0  0  -tOjSy-tijjCy  -Cy  -Sy  0 

0  0  WjCy-WjSy  Sy  Cy  0 

0  0  -aoijOjSy-fkdj&^Cy  -pa^Sy  au-jCy  (a-l^Cy-^lJWjSy 
.0  0  ao^a^Cy-pujUjSy  pojjCy  ato-jSy  (a-lH^Sy-KP+IJojjCy 


where  Cy  =  cosy  and  Sy  ■  siny. 

The  determinant  of  this  matrix  is 

CO  +  l)®!2  +  (o  -  +  P) 

Therefore,  this  is  not  zero  iff  C(3)  has  full  dimension. 

Remark  1:  (P  +  l)oij2  +  (a  -  l)o>22  *  0  implies  that 
cojand  o^  can  not  be  zero  at  the  same  time.  If  tOj  =  =  0,  then 

the  spacecraft  turns  around  63,  the  output  is  a  constant  vector  and 
it  is  impossible  to  determine  ej,  e2  from  the  trajectory  of  63(0. 

Remark  2:  The  condition  a  +  p  *  0  implies  Ij  *  I2,  If  I  j  = 
I2,  then  the  spacecraft  is  symmetric  with  respect  to  63,  We  can  not 
tell  the  difference  between  ej  and  e2.  Moreover,  e j  and  c2  are  not 
uniquely  defined.  So  it  is  impossible  to  determine  the  position  of 
cl*e2- 


In  this  section,  we  assume  that  Ij  >  I2  >  I3  and  (1)3  *  0.  If 

CO3  =  0,  then  o>j  *  0  (see  [3]).  So,  from  the  similar  method  in  this 
section,  we  can  determine  the  attitude  by  measuring  e j . 

From  [3]and  the  introduction,  we  know  that  the  motion  of 
the  spacecraft  is  totally  determined  by  the  following  three 
constants. 

(1)  The  direction  of  L,  which  is  the  normal  vector  of  the 
invariant  plane  P. 

(2)  The  distance,  d,  from  the  origin  to  the  tangent  plane  P. 

(3)  The  energy  T. 

As  the  inertia  ellipsoid  rolls  on  P,  the  vector  63  turns 
around  the  axis  L.  Suppose  that  the  coordinate  of  *3(0  in  the 
inertially  fixed  basis  is  (x(t),  y(t),  z(t)),  it  is  expressed  by  Euler 
angles  in  (2).  Imagine  that  the  curve  described  by  €3(0  has  mass 
with  the  density  a  constant  1.  Then  L  is  a  vector  passing  through 
the  center  of  this  mass.  So  the  coordinates  of  L  in  the  inertially 
fixed  basis  are 

so  so  so 

Jx(s)ds  jy(s)ds  Jz(s)ds 

=  V ■  vV-L-V  <3» 

Where  s  is  the  length  of  the  curve  described  by  €3(0  at  time  t.  The 
number  s 0  is  the  length  of  the  smallest  closed  curve  described  by 
e3(t)  if  €3(0  moves  in  periodic,  If  it  is  not  periodic,  we  must  take 
the  limit  of  these  integrals  as  the  length  Sq  goes  to  «. 

To  find  d,  we  consider  L^.  Let's  take  the  inertia 
ellipsoid  as 


Ilrl2  +  I2r22  +  I3r32=  1  W 

So,  the  vector  L  is  parallel  to 

1  lrlel  +12r2e2  +  I3r3e3 
3 

where  r=  ^rjCj  is  the  vector  from  the  origin  to  the  point  of  contact 
i=l 

between  the  ellipsoid  and  the  invariant  plane.  Therefore 


.  s 


I'm 


1 1  -  A2  A2 

f  ▼  I 


f  3 

DiV 

i=l 


d  =  Lr  = 


'  3 
i=l 


Ilrl2  +  I2r22  +  I3r32 

VP” 


ei'L  = - ...  ,—r  ri  ¥  (6) 

The  function  ta-LI  has  its  maximum  value  iff  h^l  has  its  maximum 
value.From  the  first  three  equations  of  system  (2),  we  can  easily 
prove 


(022  0)32 


=  constant 


So  (o>2(t),o>3(t))  describes  an  ellipse.  In  [2],  it  was  proved  that  r  = 
b<B  ,  therefore  (^4-3)  is  also  on  an  ellipse.  The  function  I r$(t)l 
has  the  maximum  value  implies  T2=0 .  The  equation  (6)  implies 
that  Ir3(t)l  has  its  maximum  value  iff  IL^I  has  its  maximum 
value.  Denote  this  maximum  value  of  IL^I  by  A.  So  IL^NA 
implies  T2=0.  From  (4)  and  (5),  we  obtain 

Ilrl2  +  I3r32=  1  (?) 

L  =^~=====~_  (8) 

VIl2fl2  +  I32r32 
so,  L  is  in  the  ej,  €3  plane. 


Now,  we  try  to  determine  the  energy  T  by  the  frequency  of  ej. 
Suppose 

X  =  (xj(t),  x2(t)  x3(t)) 
is  the  solution  of 

XI  =  ax 2X3 
x2 =  Pxlx3 
x3  =  yxix2 

such  that  the  initial  condition  is  on  the  inertia  ellipsoid  and 


W“2 

Its  period  is  ag.  Consider 
0)j  =  Xxj(2.t) 

It  can  be  proved  that  00  j,  o>2  “d  w3  satisfy  the  first  three 
equations  in  (2).The  energy 

T  =  J  to  Iw  =  Xy<Ilxl2  +  I2x22  +  l3X32)=-r 


The  period  if  to  is  —  .  Suppose  the  period  of  0)3  is  a,  then 


2a  =  — =-~ 
X  Vzf 


8a2 


Le,  =  ±a/1  -  fLeaV^tVl  -  A2 


The  equation  (6)  and  (9)  imply 
rj^d-  ±Vl-A2 


Therefore 


r3I3d  =  A 


rl  =  ±  Ijd  • 


A 

r2*Iid 


Substitute  (10)  to  (7),  we  have 
1  -  A2  A2 

- 1 - >T  =»  I 

I2d2 

From  this,  we  obtain 


Here,  a  is  the  period  of  CO3,  which  is  unknown.  But  we  proved 
that 

e3’L  =  ^^d 
r3  =  b<03 

where  b  is  some  constant.  So,  a  is  also  the  period  of  e3'L. 

Therefore,  we  can  use  (3)  to  determine  L,  (11)  to 
determine  d  and  (12)  to  determine  T.  From  the  proof,  we  could 
see  that  the  center  of  the  curve  described  by  <3(0  is  L,  the 

amplitude  of  ^Ll  determines  d  and  the  frequency  of  lej-LI 
determines  the  energy  T. 


system: 


In  this  section,  we  study  the  observability  of  the  following 


(13) 


<Dj  =  a©  2CO3 
©2  =  p©,©, 

(13) 

©3=Y©j©2 

y  =  ©1 

This  is  a  subsystem  in  the  spacecraft  attitude  problem  which  is 
related  to  the  angular  velocity  and  the  energy.  In  this  section,  we 
assume  Ij  >  I2  >  I3  or  Ij  <  I2  <  I3. 

Theorem  2.  If  I3  *  I2  (a  *  0),  ©j  *  0,  ©2  +  ©39*0,  then 
system  (13)  is  observable. 

Proof:  The  following  relations  can  be  easily  proved. 

Lpd©l  =  a©3d©2  +  aa^doj^ 

Lp2d©j  =ap©32d©j  +2aP©j©3d©3 
+  <xv©22d©i  +  2a7©1©2d©2 

Therefore,  the  dimension  of  the  distribution  generated  by  d©j, 
Lpd© j ,  Lp2d©j  is  the  same  as  the  rank  of  the  matrix 


"10  0 

0  a©3  a©2 

_0  2ay©i©2  2ap©1©3 

Its  determinant  is 


2a2©1(P©32  -  7©22) 

Because  I  j  >  I2  >  I3  or  Ij  <  I2  <13,  we  know  that  p  and  y  have 
different  signs.  So,  the  distribution  has  dimension  3  whenever  a 
*  0,  ©j  *  0,  ©22  +  ©32  *  0.  The  theorem  follows. 

Remark:  In  the  remarks  after  theorem  1,  we  explained  why 
the  condition  Ij  *  I2  and  ©j2  +  ©22  *  0  arise.  This  can  also  be 
used  to  explain  the  condition  on  I2, 13,  ©2  and  ©3  in  theorem  2. 
The  condition  ©j  *  0  is  necessary.  From  the  following 
discussion,  we  can  see  that  if  ©j  =0,  then  ©2,  ©3  can  not  be 
estimated  by  y  =  ©  j . 

Theorem  3.  Under  the  same  hypotheses  as  theorem  2, 


©32  ©  j2 


Because  Ij  >  I2  >  I3  or  Ij  <  I2  <  I3,  the  constants  a  and  p  have 
different  signs,  the  constants  a  and  y  have  the  signs. 

©22  ©i2 

is  an  ellipse.  So  ( max(©j  },0)  is  on  the  ellipse.  So 


Cj  =  -  a-max{©j2) 


O 

©22  =  —  ©j2  -  p-ot-max{©j2) 


©32  ©j2 


is  a  hyperbola,  ©j  *  0  means  that  ©j2  takes  its  minimum  value  if 
©3  —  0.  So 

C2  =  -a-min{©j2) 

Therefore,  the  formulas  in  theorem  3  are  proved. 

In  the  following,  we  are  going  to  find  a  kind  of  change  of 
coordinates  so  that  (13)  can  be  transformed  to  observer  normal 
form,  i.e,  we  want  to  change  (13)  and  make  it  look  like 

x  =  Ax  +  f(y,u) 

y  =  Cx 

where  (C  A)  is  an  observable  pair. 

In  [1],  this  method  is  discussed  in  detail.  In  example  7.3 
of  [1],  the  author  proved  a  necessary  and  sufficient  condition  for  a 
system  like  (13)  to  be  transformed  to  observer  form.  Unfortuately, 
it  can  be  proved  that  system  (13)  docs  not  satisfy  this  condition. 
Therefore,  we  have  to  think  about  this  problem  from  another  point 
of  view. 

In  theorem  4,  we  will  find  a  family  of  changes  of 
coordinates  x  =  x(©,c)  such  that  for  each  output  trajectory,  there 
is  a  constant  c0  so  that  x  =  x(©,c0)  transforms  (13)  to  an 

observer  normal  form.  x(©,c)  and  the  observer  normal  forms  are 
continuous  with  respect  to  c. 

Theorem  4.  Under  the  same  hypotheses  as  theorem  2, 
we  define 


m^2  =  —  ©j2  -  ya-max{©i2) 
Proof:  From  (13),  we  have 

1  d©j2  1  d©22  1  d©32 


Therefore, 


©22  ©j2 


*1  =  <al 


X2  =  a©2©3 


X3  =  aP©j©32  +  ay©j©22 

-  3  (Py  +  P2  +  y^y3  -  ap-jcy 
c  =  -  a{max(y2)  +  min(y2)) 


Then  x(©(t),c)  satisfies 


*i  =  x2 


*2  “  x3 + f  <Pir + P2 + fty3  +  aPicy  07) 

X3  =  0 


Proof:  substitute  (16)  into  (17),  and  use  (14),  (IS). 

In  this  note,  we  assume  the  spacecraft  moves  freely.  An 
obvious  question  is,  how  to  observe  the  attitude  when  the  system 
has  nonzero  input  ?  This  is  an  important  open  question. 

Another  interesting  problem  is,  for  what  range  of  the 
output,  system  £13)  can  be  estimated  by  theorem  4  without 
changing  the  parameter  c. 

The  results  in  this  note  are  applicable  to  the  rigid  body 
problem,  but  most  recent  spacecraft  research  is  directed  towards 
large  flexible  space  structures  and  the  models  are  much  more 
complicated.  However,  the  rigid  dynamics  are  still  interesting  and 
important 
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